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SOME EXTENSIONS OF THE WISHART DISTRIBUTION’ 


T. W. ANDERSON AND M. A. GirsHICK 


Princeton University and Columbia University 


1. Introduction. The well-known Wishart distribution is the distribution 
_of the variances and covariances of a sample drawn from a multivariate normal 
population assuming that the expected value of each variate remains the same 
' from observation to observation. For problems such as testing collinearity [1], 
> comparing scales of measurement [2], and multiple regression in times series 
' analysis [3], it is desirable to have the distribution of sample variances and co- 
variances for observations, the expected values of which are not all identical. 
' Such a distribution could be considered as a generalization to several variates 
' of the x” (non-central x”) distribution, as well as a generalization of the Wishart 
' distribution to the non-central case. In this paper we shall discuss the general 
| problem of finding the distribution in question and shall derive this distribution 
| for two particular cases. We shall start out with the problem in its most general 
form and as a result of linear transformations express the distribution as a certain 
multiple integral. 
We can think of the expected values of the observations as defining points in a 
' space of dimension equal to the number of variates. If these points lie on a line, 
‘the non-central Wishart distribution is essentially the Wishart distribution 
multiplied by a Bessel function; if the points lie in a plane, it is a Wishart distri- 
| bution multiplied by an infinite series of Bessel functions. For higher dimen- 
| sionality the integration of the multiple integral becomes extremely troublesome; 
) it has not been possible yet to express the general integration in a concise form. 
| These results are summarized precisely at the end of the paper. 


2. Reduction to canonical form. Consider a set of N multivariate normal 
populations each of p variates. Let the 7th (¢ = 1, 2, --- , p) variate of the ath 
(a = 1,2, --- , N) population be z;. ; let the mean of this variate be 

- (1) E(ia) = Mia @ =1,2,---,pja=1,2,---,N); 
| and let the variance-covariance matrix (of rank p) common to all N distributions 
_ be 

, | El(tia — pia) (tia — mia)] |] = || ois || (@ = 1,2,---,N). 
; Now consider a sample of observations {2z;.} one from each population. 

The purpose of this paper is to find the joint distribution of the quantities 


(2) ai = x (tia — 2) (tye — 25), 


1The results given below were arrived at independently by the two authors. Some 
preliminary results were given before the Institute of Mathematical Statisties at Wash- 
| ington, D. C., May 6, 1944, by Girshick. 
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1 N 
y= 7 Lo tia. 


a=l 


To simplify the notation in the subsequent work we treat the quantities a;; 
instead of the sample variances and covariances which are simply multiples 
(by 1/(N — 1)) of the a;;. 

The a;; may be considered as sums of squares and cross products, for there 
exists a linear transformation’. 


N 
Lia a 2 Gag Lig (z _ 1, 2, eer DP), 


where the matrix || @.s || is orthogonal (and @y1 = Ove = --- = Ovw = 1/+/N), 
such that 


n 
, , 
a= Zz Liatja y 


a=l 
where n = N — 1 and 
NE:Z; = Lin Xin ‘ 
For a given a the z;. have a multivariate normal distribution with the same 
variances and covariances as the x’s and with expected values 


N 
, 
E(xia) = 2 Oaphig = Mia, Say. 


n 
2 
“as = a Miabja + 
a= 


Then it is clear that the 7;; are the same functions of the y’s that the aj; are of 
the x’s, namely, 


N 
(3) = 2d (uie — fs) (Usa — Bj), 
where 


| N 
i= — X Mia. 
Now consider the two p by p matrices 
z= || 04; || 


and 
= || 74; ||. 


2 See, for example, [4]. 
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Let ki, K2, °** , Kp be the real, non-negative roots of the determinantal equation 
(4) |T—aAzZ| = 0. 

There exists a non-singular p by p matrix 

(5) Vv = || vii || 

such that’ 

(6) Vrv’ =I 


> 


| 
2 
0 0 eee > | 
where J is the identity matrix and W’ is the transpose of ¥. Suppose the rank 
of T is t; then ¢ of the roots are non-zero and p — ¢ are zero. For the sake of 
convenience we shall choose xi, kz, °*~ , Kk; to be the non-zero roots. If T is of 


rank ¢t, then the means ya lie in a ¢ dimensional sub-space of the original p 
dimensional space. Let us make the transformation 


p 
(8) Zia = 2d Wij Zia ‘ 
7= 


The z;. are normally and, because of relationship (6), independently distributed 
with unit variances. The mean value of Zi. is 


Pp 
E(Zia) = 2d, Vijtia = Via, 
7= 


say. Asa result of (7) 


(9) eS tad 


a= 
(10) 2 are = 0 
Let the new sum of squares of cross-products be 
(11) bi; = - Zia Zia - 


We shall first find the joint distribution of the b;; and then obtain the distribution 
of the a;; by using the fact that the b;; can be considered simply as a linear 


* See, for example, [5]. 
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transformation of the a;; in a }p(p + 1) dimensional space. For we can write 
b;; as 


rn —p p 
(12) bs; = z Vin Wik Dra tie = Dy Win Wir One « 


a=1 h,k=1 h,k=1 


The transformation (8) is performed on all variates of each observation. The 
next transformation, which is the one that results in the canonical form of the 
problem, is performed on all observations of each variate. We wish to construct 
the n by n matrix of this transformation 


® = || das || 
in the following manner: Let 


(a 
In view of (9) and (10) 
LD bredee = be (n= 1,2,--,8 
where 6;, is the Kronecker delta. The remaining elements in ® are chosen in 


any way to make & orthogonal. 
Now make the transformation 


Yia = = ap 21g 
B=1 


Because © is orthogonal, 
(13) bis = Lo YiaYsa> 


and the y’s are independently normally distributed. By virtue of the construc- 
tion of & and the properties of || via || the expected value of each y;. is zero except 
for ¢ of the variates, namely, 


E(Ym) = ky (n = 1,2, ---,2), 


Now the problem can be put in this form: Find the distribution of b;; (given by 
(13)) when the distribution of the y’s (in the canonical form) is 


P n 
\ —D 2D (via—es5¢a)? 
t=1 a=1 


(Qr)Pn/2 ' . 


where x, k2, °°* , «, are different from zero. 
The solution of our problem can be expressed as a certain multiple integral of 
tp variables. Let 


n 


bi; = - Yia Yja - 
a 


=t+l 
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Since the b;; have the Wishart distribution with n — t degrees of freedom (we 
assume n > ¢ -+ p) we can write the joint distribution of the b;; and y, (i = 
1, 2, ‘** = 1, 2, lai , t) as 


Pp 


1 2 bi; 


cc ra | bi, | i(n—p—t—1) i,j=1 
ip(n—t)_tp(p—1) Il 1 “ - 
2 Tv rain —t+ 1 —- 14)) 

i=1 


1 = = 2 (Ueq—endin)? 
$= —/ 
x (Qn) Ptre e 


Considering the equations 
t 
bis = bis + Do Yan 
n= 


as a transformation of the b;; we immediately obtain the joint distribution of the 
b’s and the y;,(2 = 1,2, ---, p37 = 1,2,---,2) as 


t 
—$ D «? 


.o 


t 
| bis — 2» nmi 
7_= 


Sate ea ae ee ee ee 
(14) gin tp(p—l) tint II r(4[n mth i- a) 
t=1 


- 5 bget > KaVan 
wer lr ; 

To find the distribution of the b;; we must integrate out the y;, , where the 
range of integration is such that the matrix 


t 
|| bis — 2, Yin Yin || 


is positive. For ¢ = 1 or 2 we can integrate (14) and express the results in a 
convenient form. However, for higher values of ¢ the integration affords con- 
siderable difficulty and has not been done for the general case. In terms of 
geometry the case t = 1 is the case in which the expected values of the observa- 
tions lie on a line in the p dimensional space. In the case of ¢ = 2, similarly the 
expected values lie in a plane in this space. Hence, we shall call these two cases 
the linear and planar cases, respectively. 


3. The linear case. In the linear case there is one root of the equation (4) 
which is not equal to zero, that is, there is simply one « in the distribution (14) 
and one set of y’s, namely yu(t = 1,2,---,p). The problem is to integrate the 
Ya Over the range for which the matrix 


| bis — yaya || 
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is positive; the integrand we are interested in is (dropping the subscript “1” 
from the x; and y, and neglecting the part not involving the y’s) 


Pp 
(15) bis — yey PO? Me™ TT dy: . 
t=1 


The determinant in expression (15) can be expressed as 

E os 

bis — ysys| = bas l(1 - xX by; uw); 
i,j= 
where 
|| bY |] = |] bes I. 
The inverse exists because the probability is zero that || b.; || is singular. There 
is a linear transformation 
Pp 


a bs Gij Uj » 


j=1 


such that 
Pp ae Pp 2 

Db’ ysys = DE Uj 

t,j=1 j=1 
and 

Kyi = lw, 

where /’ is the one non-zero root of the equation: 
' (16) | ABT — Ey | = 0, 


where B* = || b” || and Ey is the matrix with unity in the upper left hand corner 
and zeros elsewhere. ‘This fact is a result of the well-known theorem concerning 
diagonalization of pairs of quadratic forms.’ The Jacobian of this transformation 
is 


lass | = | bes |’, 


Pp 
and the range of integration is >> uj < 1. The integrand is transformed into 


t=1 


Pp 

—p—1) 2 

| be; [4 (1 “ 2d ui 
= 


}(n—p—2) 


Pp 
eM II du; . 


i=l 
Now let 
uw, = sin wv, 
Ui = COS W 14-1 (¢ = 2,3, ---,p) 


‘The transformation is the so-called ‘‘regression transformation.’’ See Madow [6]. 
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The Jacobian of this transformation is cos” w and 
Pp . p—l 
1- > u= cos! w (1 — dr; 
t=1 t=] 
The integration is over the ranges of ” < w < > and 
p—1 
4% <1. 
t=1 
We integrate the following expression 
p—l . 3(n—p—2) p—l ; 
ini?" [(: — > 2 II av {cos”” we’ "” dw}. 


t=1 t=1 


The integral of the quantity within the brackets is simply a Dirichlet integral [7] 
and its value is 


Pale — pl)? 
rain — 1) © 


The integral of the expression within the braces is a multiple of a Bessel function 
of purely imaginary argument [8, p. 79]; that is, 


CO Te Tue-00. 


Hence, the integral of (15) is 
(17) | bi; MP in — p))e?(/2) 3 Tyn_m (I). 


Multiplying equation (16) by the determinant |B| one can easily show that the 
non-zero root, I’, is simply «bi. The distribution of the b;; in the linear case 
then is 


p 
—hx2 -—3 D> d;; 
ee 


ee) Re aa 
gipn—i(n—2)_ tp(p—1) I] r(3[n aie i)) 
Iw 


| bss | 3° (Pd) Tenn (KV Bun) 


In §5 we shall give the distribution in terms of the original variables, namely, 
the Qi;. 


4. The planar case. The case of two non-zero roots of equation (4) can be 
handled by continuing the process of integration of §3 another step. The 
essential problem is the integration of 


2 


2 Pp 
(18) | bis = Z Yin Yin | sili: ceases Il II dyin 
q=1 


t=1 7=1 
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over the range of y’s for which the matrix 


2 
|| bs5 — a Yin Yin || 
n= 
is positive. The integration is done in two stages, first with respect to the yj, 
then with respect to the yz. Letting 
biz = dig — Yoidrs , 
=n-— 1, 
K 
Gi 
and omitting for the time being 
Pp 
(19) eX2u22 Il dys , 
i=l 
we can write the first stage of the integration of (18) as 


- 4Gi—p—2) ti TT oe 
[B55 — ges | me TT dg 


over the range || b:; — 9; || positive. But, the only difference between this and 
the integration of (15) which has been shown to be (17), is that we are now writing 
all variables with ““~” signs. Keeping this in mind, changing back again to 
our other variables of Section 4 and inserting again (19) we ean write the first 
stage of the integration of (18) as 


2 2 —3(n—3) 
| 4(n—p—2) Kove E oe - ” 


T(3in — p —- 1])x*” | bis — Yar Y;e | € 


4 
(20) 


pent i ar a ee Pp 
‘Tyo-9(W ibn — iia) LT dye. 


Now we must integrate (20) with respect to the yi over the range || bi; — yay; || 
positive. The determinant in (20) can be written as 
| bi; | ( _ [ b” yin ve) . 
There is a transformation 
(21) Y2 = > 95585 ; 
such that 


Pp 


7 D 
> bo yeye = a, sj, 
I= 


t,j=l 


(22) 


2 2 2.2 
KiYi2 =fsi, 


Ko Yoo = 18; + d282 ’ 
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where f’ is the one non-zero root of the equation 
|AB — GEu| = 0, 


where B" and Ey are used as in equation (16). Since this equation 
is similar to (14), then f? = xiby. The values of d; and d, will be considered 
later. This result is deduced from an extension of the theorem concerning the 
diagonalization of pairs of quadratic forms.” The Jacobian is 


Loss| = [des |, 
p 
and the range of integration is >) 8; < 1. The integrand (20) is now changed to 
t=—1 
4(n—p—2) 


Pp 
rin — p — Ix’? |b |*? ( -Ds efter tdaes 


t=1 


fl a 81) —}(n—3) P 
{ a Tyn—s) ($21 — 82) I] ds; . 
Next the following transformation is made: 
S, = sin mw, 
S. = COS UW; SIN We , 
8; = COS Wi COS Wi-2 
The Jacobian is cos” w; cos” ‘we , and 
p—2 
1 — >) s? = cos’ w, cos’ w.( 1 — >, a), 
tal 


We now integrate 


p—2 4(n—p—2) p—2 
rin — p — Ux” | dy |" | 1-¥ a) O an | 


t=_1 


. —i(n—3) 
(23) cos" Wi cos” * We gfsstnur tee cos w, sinwe ( cos w) 


-Iyn—s)(f cos wi) due aun) ‘ 


The integral of the expression within the square brackets is another Dirichlet 
integral; its value is 


rG{n — pl)? 
Tain — 2}) 


‘ Again the “regression transformation’”’ is used. See footnote for Section 3. 
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Similar to Section 3 the integration of the quantity within the braces with respegt 
to we is 


, \—i(n—8) 
(gin as 2))-V x (hone nea _ I n—s) (do cos W) 


. adil f —}(n—3) 
a sinw 
‘cos w,e"! : 5 COS Wr Ti(n-s(f cos w,), 


Since the range of integration of w, is — 7/2 < w; < 2/2 and since sin ¥; is an 
odd function and cos w is an even function, the integral of the above expression 
can be transformed into an integration over the range 0 < w; < 2/2 by replacing 
e712 "1 by 2 sinh (d; sin w:). In view of the relationship between the Bessel 
functions of purely imaginary argument and sinh (d; sin w;) [8, p. 54] we can write 
the integral of the above expression as 


sin — 2va(%) "Vai Vix 


x tr I_3(d, sin w1)I4¢n—s) (de €08 W1)Iicn—zy(f cos wi) sin $ w; Cos W; du, . 
0 


This integral can be expressed in another form by virtue of a formula in Watson’s 
Bessel Functions |[8, p. 377] as 





Ja Qo Tino + d3 + - _ Qdof cos u) + .n-3 


a —— “wae. 


0 (dj + a + f ~~ Qdof cx cos uy)? 


Letting dj + d; + f’ = xand dof = y and using an expresion formula for Bessel 
functions [8, p. 140] we can write (24) as 


7 
o/s #"™ i Z (—y ee cos’ u 47 Ty yay (4/2) sin” u du. 


Since the integral of cos’ u sin” *u where y is odd is zero, the result of the integra- 
tion (using the ‘‘duplication formula” for I functions and letting y = 2w) is 
(y? )°a —}(n—2)+2w 


T(aln — 2))x2*" 2d, I (n-2) 420 (02). 
™ “ait 5 ) 
From the relationship (22) it is clear that the equation 
(25) | Kids; — AD? | =O (Ki = Ofori = 3,4,---, 
is transformed by (21) into — 


fi +di—»r did» 0 
| dide &- xr 0 


(24) 


0 0 
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Expression (25) is equivalent to 
(27) | kik jb; = di; | = 0. 


Hence, x and y’, which are the sum and product, respectively, of the non-zero 
roots of the two equivalent equations (26) and (27), are given by 


z= fi+dit dj = xibu + xibe, 
y? = fds = Kinz(bube — bi). 
. In view of this result we can now write the integration of (23) as 


@ 


< 2.2 2 
ridin — p — ADE — ple? BO |byy|MOP x Dy een — Pan 
Pea 2” oIT > 4. ») 
(Ki Dur + 3a) 82? 7 Hin) 420(Wir buy + x2 Dap) 


Finally, by multiplying in what was left out of (18) we obtain the integral of (14) 
which is the solution to the problem as stated in the canonical form: 


P 
oa SS bes 
e ite | bi; [io—e-» e a" 





p—2 
(28) gien—h(n—2) Pe) IT r(3{n ah - i]) 


x . [ki x2(Dir bos — bi2)]° (bu + «3 Bos) 240-9 +209 


oI T (* 7% : + A) 


X Tyn—2420(W 2 bu + 2 bes). 


§. Final form. To answer the problem as stated originally it is necessary to 
make the transformation (12) and obtain the distribution in terms of the a;; for 
the linear and planar cases. 

It is clear that equation (25) is equivalent to 


(29) |T —rA*| =0, 


where A™* = || a;; ||’, for T is the transform (by (5)) of || «i6;; || and A~ is the 
transform of || b;; ||. The sum and product of the non-zero roots, which are the 
arguments of the infinite series in (24), remain unchanged. 


Pp 
Since the quantity >> «; is the sum of the roots of (4) it can be expressed as 
t=1 
Pp . Pp N i 
2» a e 2, a a” (uia = Hi) (ja - iii), 


where || o*’ || = || o;; ||". Furthermore we have 


[bis | = |Z] - | as; | 
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Xba = tr [Bes || = te (Pll aes IW) = Ye May. 


Moreover, the Jacobian of the transformation (12) is* 
(30) | J | io |v po ons | > eer 


Hence, we have the following results: 

Given N multivariate normal populations each of p variates with identical variance- 
covariance matrices || o;; || and with expected values of the pN variates Xia given 
by (1). Let a;; be defined by equation (2); let the rank of the matrix || 7:; || defined 
by (3) be t. 

(i) Whent = 0, the joint distribution of the a;; is given by the Wishart distribution, 

(ii) When t = 1, the joint distribution of the ai; is 

N 


Pp 
—§ DF ei ia—hi)(uja-hj) 
2? —] —p— 
e timla=1 | of oe | as; pom p—2) 








at 


Pp 
gip(N—-1)—-H(N—8) rir) : r(3[N —— 1\) 


t=1 


4 > otiag; 2 £ a 
X e tint | 2 Dd Gij(uia — Hi) (uja — x | 


t,j=1 aml 


Pp N 


X Lyw-s) ( DD ai(uia — Fi) (Mia — is) 


4,j=1 a=i 
(iii) When t = 2, the joint distribution of the a;; is given by 
Pp N 


—§ DL efi (uga—Bi) (uja—8j) Sg, 
a acl —p—2 
e@ timlamt | o - ) | a5; po p—2) 


p—2 
gir(N-D)—-HN-8) | tp(p—1) II r(3(N — 2 — 4) 


t=1 





Pp 
—t Z ctiag; = Uy Us)” 
Xe iin x _ ee (uy + 145) Tr 
- mtr ( — + v) 


x Tw) 420(V 1 + w), 


where u; and uz are the two non-zero roots of (29). 
(iv) When t > 2, the joint distribution of the a;; can be written by means of 


expression (14) as a multiple integral. The explicit form of the distribution has not 
yet been obtained. 


6 One method of demonstrating this fact is to apply (8) to centrally distributed variates 
and compare the Wishart distribution of the transformed variates with the Wishart dis- 
tribution of the original variates. 
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STATISTICAL TESTS BASED ON PERMUTATIONS OF THE 
OBSERVATIONS 


A. WALD AND J. WoLFOWwITz 


Columbia University 


1. Introduction. One of the problems of statistical inference is to devise exact 
tests of significance when the form of the underlying probability distribution js 
unknown. The idea of a general method of dealing with this problem originated 
with R. A. Fisher [13, 14]. The essential feature of this method is that a certain 
set of permutations of the observations is considered, having the property that 
each permutation is equally likely under the hypothesis to be tested. Thus, an 
exact test on the level of significance a can be constructed by choosing a propor- 
tion a of the permutations as critical region. In an interesting paper H. Scheffé 
[2] has shown that for a general class of problems this is the only possible method 
of constructing exact tests of significance. 

Tests based on permutations of the observations have been proposed and 
studied by R. A. Fisher, E. J. G. Pitman, B. L. Welch, the present authors, and 
others. Pitman and Welch derived the first few moments of the statistics used 
in their test procedures. However, it is desirable to derive at least the limiting 
distributions of these statistics and make it practicable to carry out tests of 
significance when the sample is large. Such a large sample distribution was 
derived for a statistic considered elsewhere [1] by the present authors. 

In this paper a general theorem on the limiting distribution of linear forms in 
the universe of permutations of the observations is derived. As an application 
of this theorem, the limiting distributions of the rank correlation coefficient and 
that of several statistics considered by Pitman and Welch, are obtained. In the 
last section the limiting distribution of Hotelling’s generalized T in the universe 
of permutations of the observations is derived. 


2. Atheorem on linear forms. Let Hy = (hi,h2...,hw) (N =1,2,...,ad 
inf.) be sequences of real numbers and let 


gitij « w* Ft (i. —~ Nn i ln) 


a=1 


for all integral values of r. We define the following symbols in the customary 
manner: For any function f(N) and any positive function g(N) let f(N) = 
O(y(N)) mean that | f(N)/¢(N) | is bounded from above for all N and let 


F(N) = 2¢()) 


mean that 
f(N) = O@W)) 


limy inf | f(N)/e(N) | > 0. 
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and that 
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Also let 
f(N) = o((N)) 


mean that 
lim F(N)/¢e(N) = 0. 


Let [p] denote the largest integer < p. 
We shall say that the sequences Hy(N = 1, 2, --- , ad inf.) satisfy the condition 
W if, for all integral r > 2, 


r(Hy) 
2.1 uly) _ oy, 
For any value of N let 
X = (x1, —— , Ly) 


be a chance variable whose domain of definition is made up of the NV! permutations 
of the elements of the sequence Ay = (aq, @2,--+, ay). (If two of the a(t = 
1,2, --- , N) are identical we assume that some distinguishing index is attached 
to each so that they can then be regarded as distinct and so that there still are N! 
permutations of the elements ai, --- , ay). Let each permutation of Ay have 
the same probability (N!)*. Let E(Y) and o(Y) denote, respectively, the 
expectation and variance of any chance variable Y. 

We now prove the following: 

THEOREM. Let the sequences Ay = (a1, d2, +++, dv) and Dy = (d,, d2, --- , dw) 
(VN = 1, 2,---, ad inf.) satisfy the condition W. Let the chance variable Ly 
be defined as 


- 
Ly = > dja. 
t=1 


Then as N — ~, the probability of the inequality 
Ly — E(Ly) <to (Ly) 


for any real t, approaches 


1 ' 2 
—4x 
7 dz. 
V/ 24 , : 
For convenience the proof will be divided into several lemmas. 
Since 


Ly — E(Ly) 
o(Ly) 
remains invariant if a constant is added to all the elements of Dy or of Ay, or 


if the elements of either of the latter are multiplied by any constant other than 
° ° / 
zero, We may, in the formation of Lx, replace Ay and Dy by the sequences Ay 


Ly 















360 A. WALD AND J. WOLFOWITZ 








and Dy , respectively, whose ith elements a; and d;(i = 1,2, --- , N) are, respec- 
tively 
N 
(2.2) a; = [u2(A,)]* («a —-N“*D “,) 
j=l 
and 
N 
(23) di = (Dn) (ae - Yas). 
Po 
The sequences Ay and Dy satisfy the condition W. Furthermore, 
(2.4) (Ay) = m(Dy) = 0 
and 
(2.5) u2(A w) = u2(Dy) =1. 
LEMMa 1, 
(2.6) Qn ,4a, °** Gg, = O(N) 


ay<ae<css*<agsNn 


(2.7) 2 °° Dy da,day ++ de, = O(N™"). 

@1<ag<ess<aeSNn 
From (2.4), (2.5), and the fact that the A), and Dy satisfy condition W, it follows 
that the Ay and Dy satisfy conditions a), b), and c) of the theorem on page 383 
of [1]. Our lemma 1 is the same as lemma 1 of [1]. 
LemMA 2. Let 
V = (%1,%,*** , Un) 


be the same permutation of the elements of Any that X is of the elements of Ay. Let 


y =v, +++ ve wherez = vida» -°+ Van »ty > 1 (7 = 1,2, ---, 7), andk,7, v1, °°, 
1, are fixed values independent of N. 

Then 

(2.8) E(y) = O(N*?I-*), 


This is Lemma 2 of [1]. 
In a similar manner we obtain that 


Ze oe 2 da, a Chote ree a 
(2.9) A prBaere A (k+r) 
° ai oww***) -O(N**") ais o(n 21+), 
The summation in the above formula is to be taken over all possible sets of k +r 
distinct positive integers < N. 
LemMMA 3. Let a, --+, air be (k + r) distinct positive integers < N. Then 
(2.10) E(uiv2 + +> Ved e+) oe Vik+r)) = E(va,Va, *** Vay a iii Gewnd 


This follows from the fact that all permutations of Ay have the same probability. 
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LEMMA 4. Let 


N 
4 , 
Ly = diy. 
i=l 


Then 
(2.11) E(L}’) = O(N)?"4), 


Proor: Expand L,/ and take the expected value of the individual terms. The 
contribution to E(L,”) of all the terms which are multiples of the type appearing 
in the right member of (2.10) with fixed k, r, 11, ---, i(k +i: + --- +7, =p), 
is, by Lemmas 2 and 3 

on™) > .- > da, Bos ae roe =... a O(N *?!-*) Q( Nn #21+7) 


@i»° 7a (k+ r) 
all aiesens 


on O(ns™***) : 


Since 7; > 149 = 1, --- , r), it follows from the fact thatk +72+---+%=p 
that 2r < p — kand that 2r = p — k only ifi, = --- = 7, = 2. Now 


k p—k p 
EC — < 
(2.12) 2[* k+r<erc< ae $5: 


Hence the maximum value of 2 H — k+risreached when r = [2 | and k = 0. 


This proves the lemma. 
From the last remarks of the preceding paragraph we obtain 
LemMaA 5. 


@.13) — B¢ua) — SBS ..- Dad «++ dd )EGH --- of) = of). 


all different 


We now prove 
LEMMA 6. 


(2.14) E(Lx) = 0 
(2.15) E(Ly) = NE(vj) + o(N) = N + O(N). 


Equation (2.14) follows from (2.2). Consider the expectations of the various 
terms in the expansion of Ly. The sum of all the terms of the type 


did;E (vw;) 


(2 d;d;)E(v1v2) = O(N)O(N~) = O(1), 


by Lemmas 1 and 2. The sum of all the terms of the type 
d;’E (vi) 
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N 
(> i’) E(v;) = NE(vj) = N, 


by (2.2) and (2.3). This proves the lemma. 
LEMMA 7. 


(2.16) Evi +++ vj) = 1+ 0 (1) 
(2.17) z wad Z. ° od wake da, - N? 4 o(N’). 


@1* +a; 
all different 


From (2.2) and (2.3), and Lemma 3, it follows that it will be sufficient to 
prove (2.17), because (2.16) follows in the same manner. Consider the relation 


; . / j / / 
N’ = (> i?) = 7 --- de, tee cc. + other terms. 
all differest 
By (2.9) the sum of these other terms must be not larger than O(N’). From 
this follows the lemma. 
ProoF of the theorem: Since 
* Ly _ Ly — E(Ly) 


= 


~ @i) 


| 


it will be sufficient to show that the moments of Ly approach those of the normal 
distribution as N — «. From (2.14), (2.15), and (2.11) we see that, when p is 
odd, the pth moment of Ly is O(N’) and hence approaches zero as N = «. 
When p is even and = 2s (say), it follows from Lemma 5 that 


128 2s)! 12 12 2 2 8 
Ris a = +m cee z Ga, wae da) E(v; eee v3) = o(N ). 
: a1 


all different 
Hence from (2.16) and (2.17) 
(2s)! 
s!2° 


From (2.18) and (2.15) we obtain that 


(2.18) E(Ly") = N* + o(N’). 


sa *28 (2s)! 
a —?* s!2s° 
This completes the proof of the theorem. 

It will be noticed that nothing in the foregoing proof requires that, when 
N < N’, the sequences Ay and Dy be subsequences of Ay and Dy. Indeed, 
the sequences were written as they were simply for typographic brevity. We 
have therefore 


















to 
ion 


om 


mal 
p is 
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CoROLLARY 1. The theorem is valid for sequences 
Aw _ (am, ens » Gyn) 
Dy _ (dm, — , nn) 
(N = 1, 2, --- ad inf.) 
provided they fulfill condition W. 

CoroLuary 2. If the elements a,(i = 1, 2,--- ad inf.) are all independent 
observations on the same chance variable, all of whose moments are finite and whose 
variance is positive, the sequences Ay(N = 1, 2, --- , ad inf.) will fulfill condition W 
with probability one. 

3. The rank correlation coefficient. For this well known statistic (see [3]) 

An = Dy = (1, 2, 3, sities ,N). 
The sequences Ay and Dy satisfy the condition W. For 


N 
z. i” — O(N**?) 
t=1 


and hence, for r > 3 


ur(A w) ur(Dy) — O(N’). 















Also 





u2(A w) _ u2(Dy) = Q(N’). 


Hence the distribution of the rank correlation coefficient is asymptotically 
normal in the case of statistical independence. This result was first proved by 
Hotelling and Pabst [3]. 





4. Pitman’s test for dependence between two variates. The distribution 
of the correlation coefficient in the population of permutations of the observations 
was used by Pitman [4] in a test for dependence between two variates which 
“involves no assumptions” about the distributions of these variates. In our 
notation, let 


(a;, da:)(@ = 1, 2, ---, N) 


be N observations on the pair of variates A and D whose dependence it is desired 
to test. Then the value of the correlation coefficient is 


N 
ND) dia;. 


t=1 






At the level 8 the observations are considered to be significant if the probability 
that N~* | Ly | be equal to or greater than the absolute value of the actually 
observed correlation coefficient is < 6. 
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In his paper ([4], page 227] Pitman points out that if the ratios of certain 
sample cumulants are ‘‘not too large,” then, as N — o~, the first four moments 
of N~' Ly will approach 0, 1, 0, and 3, respectively (the first moment is always 
zero). Our theorem and the relation (2.15) make clear that under proper 
circumstances all the moments will approach those of the normal distribution. 


5. Pitman’s procedure for testing the hypothesis that two samples are from 
the same population. For testing the hypothesis that two samples came from 
the same population Pitman [5] proposed the following procedure: 

Let one sample be 
Q,G2,°°* » am 
and the other 


Am+1, Im42,°°* » Amin- 


Write m + n = N, and construct the sequences Ay and Ay as before defined. 
Let 
d; = 1 (¢ = 1, ---,m) 


d; = 0 G@=m+1,---,N) 


and construct the sequences Dy and Dy. Then the value of the statistic con- 
sidered by Pitman is, except for a constant factor, 


(5.1) Nn (> d; a) 


At the level 8 the observations are considered significant if the probability that 
N~* | Ly | be equal to or greater than the observed absolute value of the expres- 
sion (5.1) is < 8. 


a ‘ : 
Let N — ~~, while — is constant. Then the sequences Dy are seen to satisfy 
n 


condition W. If then the sequences Ay satisfy condition W we may, for large N, 
employ the result of our theorem and expeditiously determine the critical value 
of Pitman’s statistic. 


6. Analysis of variance in randomized blocks. Welch [7] and Pitman [6] 
consider the following problem: Each of n different ‘varieties of a plant” is 
planted in one of the n cells which constitute a “block.” It is desired to test, 
on the basis of results from m blocks, the null hypothesis that there is no difference 
among the varieties. In order to eliminate a possible bias caused by variations 
in fertility among the cells of a block, the varieties are assigned at random to the 
cells of a block. If the cells of the jth block are designated by (71), (72), -:-, 
(jn), a permutation of the integers 1, 2, --- , m is allocated to the jth block by a 
chance process, each permutation having the same probability (n!)~. 

Let x; be the yield of the ith variety in the kth cell of the jth block to which 
it was assigned by the randomization process. It is assumed that 


Lise = Yin + Os + Exe, 
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where ¥/;x is the “‘effect’”’ of the kth cell in the jth block, 6; is the “‘effect’”’ of the ith 
variety, and e;, are chance variables about whose distribution we assume nothing. 
The null hypothesis states that 


6, = & = --- = § = 0. 


Let aj, be the yield in the kth cell of the jth block and 2;; the yield of the ith 
variety in the jth block. If the null hypothesis is true then, because of the 
randomization within each block described above, the conditional probability 
that, given the set {a,.}(k = 1, 2, ---, 7), the sequence 2;, 22; +--+, nj, be any 
given permutation of the elements of {aj} is (n!)"*. Permuting in all the 
blocks simultaneously we have that, under the null hypothesis, given the set of 
mn values {aj} (j = 1, 2,---m;k = 1, 2, ---,), the conditional probability 
of any of the permutations is the same, {n!)”. This permits an exact test of 
the null hypothesis. 

The classical analysis of variance statistic that would be employed in the 
conventional two-way classification with independent normally distributed 
observations is 


r= (m — 1)m >> (2;. — 2)? 
Do dy (ti3 — By. — By + 2)’ 


ty. = m* > Vij 
7 
0.5 = n p Liz 
The statistic W used by Welch and Pitman is 


W = F(m—-1+ Fy". 


Since W is a monotonic function of F and the critical regions are the upper tails, 
the two tests are equivalent. The distribution of F or W is to be determined in 
the same manner as that of the other statistics discussed in this paper, i.e., over 
the equally probable permutations of the values actually observed. The critical 
region is, as usual, the upper tail. 

Since x;; takes any of the values aj, ---, @jn with probability 1/n, we have 


(6.1) E(z;;) = n* x Gj, = a; (say). 


t= (mn)~ z= >. Vij. 


(6.2) a(x) =n 2X (aj, — aj)” = b; (say). 


o(25,; 25) = n(n — DI” »» Ojk, jeg — Oj 
[n(n — YT an) — X ain] — a; 
[n?a; — » ajx| [n(n — 1)]7* — aj 


(n — 1)* [a5 — n™* x an] = —(n — 1)*D;. 
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E(2;.) = m™ >> a;. 
o*(2;.) = m* 7. b; = b (say). 
o(x;,. 2%.) = —[m?(n — 1)]* 5b; =e (say). 


lh ~ 


55 = x Nive; 
where |] Ai» || is an orthogonal matrix and 
oe eee 
Then it follows that 
E(zx;.) = 0 
(6.7) o(z;.) =b—e (ij = 1,2,--- 
o(x;,. 3.) = 0 (i: ¥~ t25%1,% = 1,--- 


Furthermore, we have 


n—1 


(6.8) a uo = >. (x;. — x)’. 


Applying the well known identity 
LU(ai; — zi. — 2. + 2) = TU(ai; — z.;)? — mB(z;. — 2)’ 
to the definitions of F and W we obtain 
™m dX (1;. — x)’ 
vi X X (xey — 2.5)" 


The denominator of the right member of (6.9) is invariant under permutations 
within each block and equals 


X X (aj — a;)” = (n — 1)m*(b — ¢). 


(6.9) 


Hence 


W = [m(n — 1)(6 — o)]~ > (2;. — x)” 
(6.10) a 


= [m(n — 1)(b — o)]” te. 


If the joint distribution of the x;.(i = 1, 2, --- , m — 1) over the set of admissible 
permutations approaches a normal distribution with non-singular correlation 





PERMUTATIONS OF THE OBSERVATIONS 367 


matrix as m, the number of blocks, becomes large,.it follows from (6.7) and (6.10) 
that the distribution of m (n — 1) W approaches the 2’ distribution with n — 1 
degrees of freedom. Hence it remains to indicate conditions on the set {a,x} 
which would make the distribution of the 27. approach normality. Each a3. is 
the mean of independent variables, so these conditions need not be very 
restrictive. 

According to Cramér [8], Theorem 21a, page 113, if the variances and co- 
variances fulfill certain requirements (the limiting correlation matrix should also 
be non-singular) and if a generalized Lindeberg condition holds, normality in the 
limit will follow. Somewhat more restrictive conditions which are simpler to 
state and which will be satisfied in most statistical applications are that 0 < c’ < 
b; < c” for all 7, where c’ and c” are positive constants. Since the variance of 
tis is (n — 1)’nb; , it can be seen that the above inequalities imply the fulfillment 
of the conditions of the Laplace-Liapounoff theorem {see, for example, Uspensky 
[9], page 318). By [6.7] the correlation matrix is non-singular. 


7. Hotelling’s generalized 7’ for permutation of the observations. In this 
section we shall restrict ourselves to bivariate populations, the extension to more 
than two variables being straightforward. Let (ui, U1), --+, (Uim, Uem) be m 
pairs of observations on the chance variables U;, U2, and (ui¢miy , U2(m4t)), °°" 5 
(uw, Ue), be n pairs of observations on the chance variables U; , U2, where 
m+n = N. If each of the pairs Ui, U2, and U;, U2 is jointly normally 
distributed with the same convariance matrix, the Hotelling generalized T for 
testing the null hypothesis that 


(7.1) E(U,) = E(U;) 


and 


(7.2) E(U:) = E(U)), 
is given (Hotelling [10]) by 


2 2 


T? = N‘(mn) 2d 2d qis(%; —U;) (a; — Hj) 


7 1 


and the matrix || q:; || is the inverse of the matrix || b;; || with b;; given by 


™m N 

(N — 2)b;; = dX (us: — Us)(ujy — @j) + 2, (us — W;)(ujy — @}). 
In Hotelling’s procedure the 6;; are sample estimates of the population co- 
variances whose distribution is independent of that of the sample means. A 
constant multiple of the statistic T”’ has the analysis of variance distribution 
under the null hypothesis. If the population covariances were known and used 
in place of the b;; , T’ would have the x’ distribution with two degrees of freedom. 
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Let us now apply the generalized T over the permutations of the actually 
observed values, as was done with other statistics in previous sections. If we 
do this literally we will find that the b;; are no longer independent of the sample 
means. To avoid this complication we shall use a slightly different statistic 7’ 
which, as will be shown later, is a monotonic function of 7’, so that the test based 
on 7” is identical with that based on 7’. The statistic T’ is defined as follows: 
Let 


N 
U; = N" > wn 
k=l 


Ci; = N((N - Uma}? (Use — U,) (uj ro 0;) (z, j, = 1, 2) 


and 
, , _ 
Il ges || = [ees I 
Then 
3 : / , , 
(7.3) 7 = » 2» qis(U; — U;) (a; = uj). 
t=] j= 
The expression 7” is much simpler than 7”, since the coefficients qi; are 
constants in the population of permutations of the observations. We shall 
show that 7” is a monotonic function of T”. Let 
N 


Qi; = : (ui — U)(ujy — %) + DD (we — T)(up — 4) 


Q; = . (uz — U,)(ujy — U;) 
QO" Il = 1] Qe5 117 
1’ || = |] Qs |. 


Then the expressions 
2 2 


(7.4) Ti = DL Ma; — a); — 4) 


j=l j=l 
and 
2 2 


(7.5) T: = > O(a; — 4%); — 4), 


tol jal 


are constant multiples of T” and 7”, respectively. Hence it is sufficient to show 
that T? is a monotonic function of Tj. We have 


(7.6) Qi; = Qi + ma; — Ua; — 0) + na; — Ui); — TU). 
Furthermore, we have 


mi; + ni; _ nh; — ai) 


(7.7) ee 
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Similarly 


m+n m+n 
From (7.6), (7.7) and (7.8) it follows that 


" 1 mi; + ni; ma — a) 
ne. eB Py. — Be... 


(7.8) a;-U,=4a 


nm? 
(m + n)? 


2 
Qi; = Q;+ one 


n a al ta -/ 
(m +n? (i; — w;)(a; — %;) + 
(7.9) 


(tu; — &;) (a; — 75) 


= Q:3 + a (a; — w;)(a; — a). 
mn 


Denote by \and @; — a; by h;. Then we have 
m+n 


(7.10) Qis = Qij + Mhih;. 
Denote the cofactor of Q;; in || Q:; || by R:; and the cofactor of Q;; in || Qi; || by 
Ri; . Then 


aarp (Ql, 1@sl © Ql 
; | Qi; | | Qi; + Ah; h; | | Q:; | + ATTRA; hs h; 1 ad \T? 


Furthermore, we have 


|Qis| _ |Qis — Anshs| _ | Qis| — ABBRishshs _ 
| Qi; | | Qi; | | Qi5 | 


From (7.11) and (7.12) it follows that T3 is a monotonic function of Tj. Hence 
also T” is a monotonic function of T” and, therefore, we do not change our test 
procedure by using 7” instead of T”. 

Let the sequence of pairs 





(7.12) i— 2». 


(a . X21), °** , (tin , Zew) 
be a permutation of the actually observed pairs 
(ui, Ui), *** » (Win, Uew) 


where to each permutation is ascribed the same probability (N!)~*. Then one 
obtains for 7 = 1, 2, 


(7.13) E(z; — #:) = 0 


(7.14) o(%; — %;) = N[(N — 1)mn}"* > (uy; — 0)? = cis 


N 

(7.15) E(# — %)(% — #) = NI(N — 1)mn]" a (a1; — U1) (ues — Ue) = cig. 
= 

Hence || ¢;; || is the covariance matrix of the variates 


(%—%) and (%— %). 
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Now we shall show that the limiting distribution of T”, as N > ©, is the ¢ 
distribution with 2 degrees of freedom, provided that the observation yu, 
(i = 1, 2;k = 1,---, N) satisfy some slight restrictions. Since || gi; || is the 
inverse of the covariance matrix || c;; || our statement about the limiting distri- 
bution of 7” is obviously proved if we can show that # — #’ and # — % have 
a joint normal distribution in the limit. 

Let N — © while m/n remains constant. Let the sequences Ay and Dy of 
Section IT be defined as follows: 

There are two sequences Ay , denoted respectively by Aiy and Aoy , such that 


ais = Ui; (@@ = 1,2;j7 =1,---,N). 
Also 


d; = l,-+*,@) 


: 
m 
Gj =m+1,---,N). 


Then the sequences Dy satisfy the condition W. If also the sequences Ajy 
satisfy the condition W, the distribution of #; — Z; approaches the normal 
distribution as N increases, by the theorem of Section 2. If the joint distribu- 
tion of 4; — &; and Z — % approaches a normal distribution with non-singular 
correlation matrix, the distribution of 7” approaches that of x” with two degrees 
of freedom. 

The correlation matrix of (Z, — #;) and (Z — #2) will be of rank two in the 
limit if the correlation coefficient between (Z, — #1) and (% — #2) approaches a 
limit p, where |p| <1. By (7.14) and (7.15) this is equivalent to saying that 
the absolute value of the angle between the vectors Aiy and Agy is eventually 
greater than a positive lower bound. We shall show that, if the correlation 
coefficient approaches, as N — ©, a limit p whose absolute value is less than one, 
and if Aiy and Aoy satisfy the condition W, then (#, — #1) and (# — 2) are 
jointly normally distributed in the limit. 

Let 5; and 6, be any two real numbers not both zero. Then the sequence 


* * of 
Ay = (a, “= , Gy) 
where 
* 
a; = 6101; + 6202; 


will be shown to satisfy the condition W. If either 6; or 6 is zero this is trivial; 
assume therefore that neither is zero. Without loss of generality we may assume 


N 
that >> a;; = 0, for if this were not so we could replace the original a;; by a;; = 
7=1 


a;; — N~ >> a;; as was done in Section 2. Let p’ be such that 1 > p’ > |p|. 
j 
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For N sufficiently large we have 
y(An) = N81 p ai; —2 | 61 5.  ® 1; A; | + 53 Ze a>;) 
J i 


> N(8i 2X ai; a 2p! | bdo| V(X ats)(Q as) + 53 2X a3;) 
= N(\| VX aly — || VD a, )* 


+ 2(1 — p’) | 618 | V(X ais) (do a2;)] 
and 


y2(Ax) < 2(Stue(Aiw) + d2ue(Aew)). 
Hence 


(7.16) yo(An) = O{max {u2(Arw), He(Aew)}]. 

Also u,(Ay) is a sum of constant multiples of terms of the type 
N* X a} ;a3;". 

By Schwarz’ inequality 


(7.17) N* a aj a3; < N“( aii)*(do a; _ (u2s(Aiw) Marrs (Aow))’. 
7 2 2 


The required result follows from (7.16) and (7.17). 
Since the sequences ic satisfy the condition W, the limiting distribution of 


i1(% — #1) + &2(% — #2), 


for any pair 6; , 6. not both zero, is normal. From this and a theorem of Cramér 
and Wold ({11] Theorem 1; see also [8], Theorem 31) it follows that if the joint 
distribution of (z; — 21) and (% — £2) approaches a limit, this limit must be the 
normal distribution. From a theorem of Radon ([(12]; see also Cramér [8], page 
101) it follows that if the joint distribution of (#, — 1) and (% — #2) does not ap- 
proach a limit as N — © it is possible to find two subsequences of the sequence 
(1, 2, --- , N, --- ad inf.) for each of which the joint distribution approaches a 
different limit. This contradicts the previous result. Hence the limit exists 
and is the normal distribution. This proves our statement that the limiting 
distribution of 7” is the x’ distribution with two degrees of freedom. 

The statistic T’’ seems to be appropriate for testing the null hypothesis that 
two bivariate distributions II, and Iz are identical if the alternatives are re- 
stricted to the case where Ilz differs from II, only in the mean values, i.e., the 
distribution Il; can be obtained from I, by a translation. This is no restriction 
as compared with Hotelling’s T-test since also the T-test is based on the assump- 
tion that the two normal populations differ at most in their mean values, i.e., 
the covariance matrices in the two populations are assumed to be equal. 
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ERROR CONTROL IN MATRIX CALCULATION 


F. E. SArrerRTHWAITE 


Aetna Life Insurance Company 


1. Introduction. The solutions of large sets of simultaneous equations and the 
inversion of matrices are often complicated by the fact that errors, such as those 
introduced by rounding, become magnified in the course of the calculations to 
such an extent that the results are useless. In this paper we shall show that if 
the norm of the matrix A — J is less than 0.35, operations involving the inversion 
A or the multiplication by A~ will be in a state of error control for “Doolittle” 
methods of calculation. Thus such calculations may be carried through with 
assurance that the errors in the results will be limited to two or three significant 
figures. We also point out that as soon as an approximation to A is available, 
most problems may be restated to bring them within the requirements for error 
control. Therefore the solution can be immediately completed to the desired 
degree of accuracy in one step instead of requiring multiple steps as do the 
iterative methods. 


2. The inversion of special matrices. Consider the problem of inverting the 
matrix (I + F) where J is the identity matrix and (J + F) is a non-singular sauare 
matrix. Let 


(2.1) G = (I+ F)". 


Then 

(2.2) I+ F)G=I 
or 

(2.3) G=I — FG. 


In ordinary algebra this would not be a practical formula for the calculation of G. 
However in matrix algebra the situation may be different. Examine the ex- 
panded form of G: 


(2.4) gig = 51; — Ufige;. 


The summation is over all values of k from 1 ton. Next examine the affect of 
imposing certain restrictions on F. For example, let fi; = Oifj7 > 7. This is 
equivalent to making the summation in (2.4) over the range 1 toi — 1. The 
first row of (2.4) then becomes 


gis = 93; 
and no g’s appear on the right. For the second row 


oi = 52; — forgi; - 
373 
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The only g’s on the right are those on the first row which have already been 
calculated. For the third row 
9s; = 43; — faigi; — faoge; - 


The only g’s on the right are in the first and second rows and have already been 
calculated. Similarly for the fourth and later rows. 

Thus it is seen that if F is a “pre-diagonal’”’ matrix, (2.3) is a very simple and 
practical formula for the numerical calculation of the inverse of (I + F). Also 
if F is a post-diagonal matrix, (2.3) may be used by working up from the bottom 
row. 


Similarly, if a matrix H is to be multiplied by the inverse of (J + F), let 
(2.5) G=(1+ F)"H 
and the working equation becomes 
(2.6) G =H — FG. 


The inversion of a diagonal matrix is accomplished by inverting each of its 
diagonal elements. That is if 


(2.7) F = || 6:;8: || 
then 
(2.8) F™ = || 658i" ||. 
3. The inversion of general matrices. The general inversion problem will be 
solved if a general matrix can be factored into matrices of the special types 


treated in the last section. For the moment assume that such a factorization is 
possible and let the factors of the general matrix, A, be 


(3.1) A= (i+ DST + 11) 


where R; is a prediagonal matrix, S; a diagonal matrix, and 7; a postdiagonal 
matrix. Then 


(3.2) A=S8.4+ RiSi + STi + RST). 

A slight change in form now appears desirable so let 

(3.3) A = (RS* + IDS + S"T) 
R+S+T7+RS‘SS‘T 

(3.4) R+8+T+ RST. 

For convenience let 


(3.5) B=R+8-+T 
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and remember that R, S, and 7 have no non-zero elements in common. There- 
fore the non-zero elements of R, S, and J are equal to the corresponding elements 
of B. Rearranging (3.4) gives 


(3.6) B=A-—RS"T 


and the elements of B are determined by 


Skk 
Since rx = 0 for k > 7, there is no point in making the summation beyond 
k=i-—1. Alsosince t,; = 0 for k > 7, there is no point in making the summa- 
tion beyond 7 — 1. Therefore the summation in (3.7) is to be considered to be 
over the range 1 to the smaller of i — 1 andj — 1. The7’s, s’s and ?’s on the 
right of (3.7) can now be replaced by the corresponding b’s: 
(3.8) w= ay — 5 ee, 

bx 
Since the first row (column) of b’s equal the first row (column) of a’s, the second 
row (column) of b’s is a function of only those b’s in the first row and the first 
column, etc., any calculation routine which works down from the top and from 
the left to the right will lead to a ready determination of all the b’s by (3.8). 

Thus we see that the assumed factorization (3.3) of A is always possible (unless 
some of the diagonal elements, bi, , of B are zero) and moreover the elements of 
the factors are readily calculated by the simple equations, (3.8). 

Therefore, to invert the general non-singular square matrix A, calculate the 
elements of an intermediate matrix B = R + S + T by equations (3.5) and (3.7). 
Then from (3.3) we have 


(3.9) At=(1+ S°T)* S'‘\(I+RS°)* 
which can be readily calculated by the methods of (2.3) and (2.6). 
4. The Doolittle method. The Doolittle method of matrix calculation can 


now be expressed in terms of the matrices R, S, and 7 studied above. To 
illustrate we shall use the set of equations: 


(4.1) Ay, + Ate + aist3 = CuYi + CeY2 + Crys = di, 


Gat, + Aae%2 + Aa3%3 = Cari + CooY2 + Coss = de, 
GsiX1 + Agee + Agzx¥3 = Csi + CooY2 + CasYs = ds. 
This set of equations will be represented in the form of a three element matrix, 


(4.1.a) || AX: CY: D||j. 
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The essential feature of the Doolittle method of solution is that we replace (4.1) 
by an equivalent set of equations for which the prediagonal coefficients of the 
X’s are all zero and the diagonal coefficients are all unity. Therefore consider 
the set formed as follows: 


(4.2) || AsX : C3Y : Ds|| = S“{|| AX: CY: D|| —R|| As¥ : Cs¥ : Dsl}. 

Then 

(4.3) A; = S“{A — RAs} 

or 

(4.4) (S+ R)A; =A 

or 

(4.5) A; = (S+ R)“(RS* + DSU + ST) by (3.3) 
= (I+ S"T). 


Since ST is a postdiagonal matrix, || AsX : C3Y¥ : Ds || are the required inter- 
mediate equations for a Doolittle type of solution. 
The final solution is now easily obtained. Consider 


(4.6) || AyX . Cy : D, | = || AzX : C3Y . Ds; || — (S°T) || AyX . CiY : Ds |). 
We have 
(4.7) Ay = As — (S"T)At 


or 
(4.8) (I+ S°T)Ay = As 

I+S°T by (4.5), 
Therefore A, is in fact the identity matrix and (4.6) can be rewritten 


(4.9) ||X:CiY : Da|| = || AsX : Cs¥ : Ds || — (S“T) || X : Ca¥ : Da|| 


5. The non-symmetric case. In actual practice, the work has to be so 
arranged that the elements of the matrices R, S, and (S‘T) are set out so as to 
be readily available for use as multipliers in forming the intermediate and final 
sets of equations. Table I gives such a. practical layout for the non-symmetric 
case. 

The elements of (S*7’) are set out as the postdiagonal elements of A; so that 
they do not need further attention. To determine the elements of FR and S, 
we form a set of pre-intermediate equations: 
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A — RiAs — I] 
A—R{I + S°’T) -T) by (4.5), 
=A—RS"T 
(5.2) =R+S8S+4+T. by (3.5) and (3.6). 


Therefore we see that the prediagonal coefficients of the z’s in (5.1) are the 
elements of R and that the diagonal coefficients are the elements of S. The rest 
of the coefficients in this set of equations are not needed in the calculations and 
have been indicated by dots in Table I. 


6. The symmetric case. If the A matrix is symmetric, advantage can be 
taken of the fact that the B matrix is also symmetric. Therefore 


(6.1) S+T=S8+4+R 


and the elements of S and R’ can be written down just before the division by s,; 
in the calculation of the A; matrix: 


(6.2) A; = (I+ S'T) = S“(S+ T) by (4.5) 
= S\(S+ R’). 


The layout of the work is given in Table II. 

If the Cy matrix is symmetric, the prediagonal elements of C, can be entered 
by symmetry. Therefore it is not only unnecessary to calculate the prediagonal 
elements of C,, but we can also omit the prediagonal elements of C3. Note 
that in this case C, must be calculated from the right to the left as well as from 
the bottom up. 

The most important case where it is known in advance that C, is symmetric is 
the determination of the inverse of a symmetric matrix. Then C = J and ( 
= A. Also the postdiagonal elements of C; are all zero so that the only 
elements of C3; which have to be calculated are the diagonal elements. These 
are the reciprocals of the s;;’s. 

A case where C, is symmetric though C # J will appear in a subsequent paper. 


7. Norms. In order to state the conditions for error control in a matrix 
calculation, a concept of the nerm or the absolute value of a matrix is necessary. 
In this paper the norm will be defined as the square root of the sum of the squares 
of the elements of the matrix. That is 


7.1) N(F) = V22i(fi)?. 
The two basic inequalities satisfied by the norm are 


(7.2) N(F + G) < N(F) + N@) 
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and 
(7.3) N(FG) < N(F)N(GQ). 


All other properties of the norm are derived from these. 
For future reference we list the following norm relations: 


(7.4) N((F)Z + @] = N(F + FG) 
< N(F) + N(F)N@) 
< N(F)ll + N@)). 
If N(F) < 1 we have 
(7.5) Nil + Py* —- = N= FP +P ~~. - 2 
< N(F) + [N(F)P + IN(F)P + --- 


N(F) 
1 — N(F)° 


If N(G — I) < 1, (7.5) becomes 


o* tid 


When N(F — I) <1 
(7.7) N(F'G) < NII + (F — Dj{Gl 
< [i+ N(F* — DING) 


N(G) 
i-NFOD by (7.6) 

8. Error matrix and error norm. We shall also need a formal statement as to 
what we mean by error and we need a measure of the errors. 

By an error matrix we mean the matrix whose elements consist of the differ- 
ences between the value of the matrix elements as actually calculated and the 
true value of the matrix elements which would have been obtained if all calcula- 
tions had been made exactly without any rounding or other approximations. 
The fundamental relation for the error matrix, E[f{(@)], of a function, f(G), of G 
is 


(8.1) Elf(@)] = fl + E@)] — f@. 


If each element of a matrix is calculated to qg decimal places and the matrix 
has p rows and p columns, the maximum rounding error introduced in any 
element is .5 X 10%. The norm of the error introduced by rounding is less than 


(8.2) NE, = Vp*[.5 X 10-¢P 
5X 10° X p. 





380 F. E. SATTERTHWAITE 


For triangular matrices 
(8.3) NE, = 5 X 10 *V/p(p + 1)/2. 
For one column matrices 
(8.4) NE; = 5 X 10 */p. 
For future reference the following formulas for error norms are listed: 
(8.5) NE(F + G) < NE(F) + NEG). 
(8.6) NE(FG) < NE(F)N|G + E(G)] + NE(G)N(F) 
< NE(F)N(G) + NE(G)N(F) + NE(F)NE(G). 
(8.7) NE[FU + G)] < NE(F)L + NIG + E@))| + NEG@N(F) 
< NE(F) + NE(F)N(G) + NE(@)N(F) 


+ NE(F)NE@G), 
If N(F — 1) + NE(F) <1, 


(8.8) NE(F") = NEI + (F - DI" 


-¥( oso Teen 
~~ L+(F+ ke) -T 1+F-2D 
= N{I+ [F+ E(F) — DD} {1+ (F-D} {EP} 


NE(F) 
Sf — WF + E® — Ml —- Ne -D} 


“ NE(F) 
fi -Nf —D — NEM — NF Di 


If N(F — I) + NE(F) <1, 


by (7.7), 





(8.9) NE(F7G) = n| G + E@) G 


I+(F+E®)-T I1+F-D 





_ Nl [AG@))U + (F — D) — (EPG! | 
(i+ [F + EF) -Djytl+ &-D} 
< NE@) + {INE(F)I/0 — NF — DUIN@} 
1— NF + EF) — 1] 


by (7.7). 
9. Certain maxima. The R, S, and 7’ matrices have no non-zero elements 

in common so that 

(9.1) N(B -—I) = N(R+S8S -—-I1I+4+T) 


= V(N(R)P + [NS — DP + [N(7)P. 
Similarly 


(9.2) NE(B) = V[NE(R)? + [NE(S)P + [NE(T)P. 
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The developments of the following formulas for certain maxima are not given 
here since they involve only well known calculus principles. It is understood 
that these inequalities hold whenever the quantities involved exist. Usually the 
maxima are given subject to the condition N(7T) = N(R) as well as for the 
unrestricted case where N(7') may be zero. 


(9.3) max ) as =1—V71— [M(B — DP. 


N(R) N(B-D 
—NS-D” 1 We= DF 
N(B- 1) 
~ V2V1— (NG = DF 
(9.6) max [N(R)NE(T) + N(T)NE(R)] = V[N(B — DF — [NS — DP 


. 


Xx V[NE(B)P — [NE(S)P. 





(9.4) max T if N(T) = 0, 


(9.5) if N(T) = N(R). 


Any substitution satisfying the relation 


NE(R) _ N(T) 


~) NE(T) ~ N(R) 


will cause (9.6) to attain its maximum. 
(9.8) max[NE(R) + kNE(S)] = Vi+NE(B) if NE(R) = NE(T), 
(9.8.2) =VY1+RNE(B) if NE(T) = 0. 


k+N(R) _ [V(B — DP 

(9.9) max i-NS—-D = k + —— 

where K is the root of 

(9.10) [1+ WK? + 2{k{N(B — 1)}"]K + {[N(B — DI‘ — [N(B — DP} =0. 


10. Errors in the Doolittle method. In all that follows, we shall assume that 
N(A — I) is small enough so that the “divisions” are permissible. First let us 
examine in the multiplies B= R+ S+T. By (8.6) 


(10.1) B-I= G4 ~ 1) -2e'F. 


Therefore 


NB-N<MaA-D+ ARO. 


(10.2) <N(A-D+1-WV1i—[NB— DP by (9.3) 


by (7.3) and (7.7), 





382 F. E. SATTERTHWAITE 


Remembering that A has no error and letting NE, be the rounding error norm 
introduced in writing down the elements, we have 


NE(R)N(T) — NE(T)N(R) + NE(R)NE(T) 
1— NiS+ EO) —-T 


NE(S)N(T)N(R) 
* Ti — MS + BS) — M1 —NGS—D} 





(10.3) NE(B) < NE, + 





by (8.9) and (8.6). 


We are interested only in the range of values where the errors are small. There 
fore we shall ignore second order errors. Except for such errors, 


_ N(R) 

—-NS-TD 

N(T)N(R) 
[i — NMS — IP 


The last term will be largest when N(7T) = N(R). Therefore the sum of the 
second and third will be largest when NE(R) = NE(T) by (9.7). 


N(R) 
—N(S—TD) 


(10.4) NE(B) < NE, + _N(T) ER) +7 


Wa ST NE(T) 


+ NE(S). 


NE(R) + Fe Sot | NE(S). 


(10.5) NE(B) < NE, + 2 — 


By (9.8) we now obtain 


(10.6) NE(B) < NE, + Kx\/2 + K? NE(B) 
ie eee 
=1-KV2+ FE 


- N(R) — NB-D 
K = max i-we-5b* V2 Vi - We — DE by (9.5). 


Actually in practice we introduce the rounding error in (S ‘7’) instead of in T 
as assumed above. Our assumption is conservative since the division by S 
magnifies any error in 7’. 

Next consider the errors in the C3; (or D3) matrix. From (4.2) 


(10.7) C; = S*(C — RC3) 
‘= SC — S*RC, 
= (I+ S’°R)'S’C by (2.5) and (2.6), 
= [SI + S“R)JC 
= (S+ R)'C. 
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Therefore 
N(C) 

lee 

(10.9) N(C3) < i-We+e-T 
N(C) 

i < 5 

(10.10) Si-WR -NS-D 


From (10.7), remembering that C has no error, and letting the rounding error be 
NE; ’ 


NE(R)N(C3) + NE(C;)NIR + E(R)] 
eo mse ee 
___ NE(S)IN(C) + N(R)N(C)] 
{1 — N[S + E(S) — I}{1 — NS — D} 
NE(R) + NE(S) 
-i-N@®)-NG-D N(C) + rant — NIS + E(S) -D} 

™_ 1 — N(R + E(R)] — NS + ES) — I] 








+ 





by (7.7) and (7.3), 








N(C) + N(R)N (C3) 
1—-N@-D 


N(C) N(R) 
[} + me ean] Py cos, 


1—-NS-—D 
<< -s— M(CUL- NS — DI 
~ [1 — N(S — Dil — N(R) — NS — 0) 


and since transferring the terms in NE(C3) to the left requires that we divide 
through by 


(10.13) 1- N+ #®)) __1—- NR + E(R)] — NIS+ E(S) ~ 
' 1— NS + ES) + 1 1— NIS+ HS) — 1] 
Again we can ignore second order errors. Taking maxima by (9.8.a) gives 


V2 NE(B)N(C) 








+ NEs 
(10.14) 





1—-V2N(B-T) 
and 


NE(C;) V/2 NE(B) [N Es]/[N(C)] 
(10.15) ——— < — > ~ : 
NC) ~f1—-VY2N(B-DP 1-V2N(B-]D) 
Thus we see that the proportionate error in C3 is made up of two parts: the first 


due to the rounding errors in the multipliers as given by the first term and the 
second due to the proportionate rounding error introduced in calculating C;. 
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Finally we have the errors in the C, (or D4) matrix. Since 
(10.16) C,= AC 
we have 


(10.17) NG) < yaya: 
By (4.6) 
(10.18) C. = Cy — (S? TYG. 
If we let NE, be the rounding error introduced in this step 
(10.19) NE(C) < NE(C:) + NE + eee ei ee 
NE(S)N(T)N(Ci) 
hi — NS + E(S) — I}{1 — NS — 0} 
{NE(C3) + NE,} {1 — N[S + E(S) — J} 
+ [yen + al Ls NG =I q 
“T— NT + KD] — MS + ES) — I 





by (10.17) and relations similar to (10.13). We now ignore second order errors 
and take maxima by (9.4) and (9.8.a): 


NE(C:) , (NEs) , V1 + K? NE(B) 
(10.20) NEC) < NC) NO) _1-NA-D 
NC) ~ 1 — V2 N(B — I) 

















1 V2 
arate (1—+/2 N(B— a 7 
_ [NE3]/[N(C)] NE 
1—-V/2N(B-D* NC) 
1— VW2N(B-— 1) 


where 


ae eg 4 
(10.21) K = max i-NG—H ~ Vizweate by (9.4), 
and 

(10.22) V1 + K? = 1/71 — [N(B — DFP. 


If A is symmetric, NE(B — I) remains unchanged but NE(C,) can be some 
what strengthened through the use of (9.5) and (9.8) instead of (9.4) and (9.8.a). 





MATRIX CALCULATIONS 


The result is 


NE(C,) 1 
(10.23) VO. < [ 


[1 — N(A — DIV2V1 — [N(B — DP 


V/3/2 Il NE(B) ] 
(1 — 3/2 N(B — NPILI — 73/2 N(B — 1 


+ coil NE; 
(1 — 3/2 N(B — 1)|* NC) 
1 NE, 


+ 74/3/72 N(B- D N(C)° 


If C is approximately equal to J, a better formula is obtained by substituting 
(I+ C,) for C. The final formulas then are identical to (10.20) and (10.23) with 
the substitution of 1 + N(C;) for N(C). Similarly if C = J so that C, = A’, 
N(C) = 1 should be substituted in (10.20) and (10.23). 

If A is symmetric and if C = J so that C, = A’, the prediagonal elements of 
C, are filled in by symmetry as in Table II instead of being calculated directly. 
This complicates the analysis of error relations. The following inequality gives 
the error limit for the diagonal and postdiagonal elements of (A~’). We have 
indicated these elements by F. 


VBAl—- NAD) FING = DK 
, E(F eet acca tienen tele ta eicemineeeae B 
po wre a - e- eo a 
NE; 
1-—-/2N(B-TD 


where K is the root of (9.10) when k = 1 — N(A — I) and NE; = 0.5 X 10°* X 
Vp(p + 1)/2 by (8.3). 


> 


11. Results. Given the matrix A, we subtract one from each element on the 
principal diagonal to obtain the matrix (A — J). By the norm of (A — J) we 
mean the square root of the sum of the squares of the elements of (A — J). We 
shall now show that a Doolittle process such as outlined in Table I is in a state of 
error control if the norm of (A — [) is less than 0.35: 

1, N(A — I) < 0.35 

2. N(B — I) < 0.4642 by (10.2). 

3. NE(B) < 1.09 p by (10.6) and (8.2) 
if the maximum rounding error in any element is 0.5 and A has prows. Thus no 
element of the multiplying matrices R, S, or JT can have an error of greater than 
this amount. 

4. NE(C,) < (44 X p X 10 *)N(C) + ( X p X 10°’) by (10.20) and (8.2) 
where g decimal places are carried on the left and r on the right. Thus if the 
decimal point in C is shifted so that N(C) < 1, the error in any element of C, can 
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not amount to more than three significant figures if the same number of decimal] 
places are carried on both the left and on the right (four significant figures for 21 
to 200 rows). 

5. NE(A7*) < three significant figures by substituting N(C) = 1 since C = J. 

As we let N(A — I) become larger than 0.35, the maximum errors indicated 
by our formulas rapidly become very large. In fact they become infinite if 
N(A — I) = 0.414. 

Since for more than four equations the above formulas show errors in the 
second decimal place no matter how small N(A — [) is, it is suggested that as a 
general practice: 

1. The problem be arranged so that N(A — J) < 0.35. 

2. The decimal points in C be shifted so N(C) < 1. 

3. Three extra decimal places be carried in the calculations. 


12. Preliminary adjustments. The requirement that N(A — I) should be 
less than 0.35 is not normally met in practical problems. If, however, an 
approximation to A” is available, the problem can almost always be rearranged 
to satisfy this condition. 

Thus if we are solving the equations such as given in Table I, 


(12.1) AX = CY =D, 


we are perfectly free to multiply through by any non-singular matrix F without 
disturbing the solution: 


(12.2) (FA)X = (FC)Y = (FD). 


Now if F is a sufficiently close approximation to A’, FA will be almost equal to I. 
Therefore N(FA — I) will be less than 0.35 and a Doolittle solution of (12.2) will 
be in a state of error control. 

Similarly for the inversion of A, we can apply the Doolittle process to the pair 
of matrices 


just as easily as to the pair 


since 


(FA)*||FA:F || =A'F" || FA: F|| 
AT(FUF)A : AT(F“F) || 
= ||A“A:A™|| 
| 2: A |]. 


. ° e . al ° 
Thus by taking F as a sufficiently close approximation to A, we can bring an 
inversion calculation into a state of error control. 
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The computor should be cautioned that the multiplication by F must be exact 


and that no rounding is allowable in this step. Our formulas assumed that we 
started with matrices free of error. 


13. Further work. The principles used in this paper can be applied to the 
task of developing calculation routines which will be in a state of error control 
regardless of the size of N(A — J). Enough work has been done to see that such 
routines do exist and do not involve prohibitive labor. The author expects that 
the most efficient routine will be to use these more elaborate methods to obtain 


an F such that N(FA — J) < 0.35 and then to use the normal Doolittle methods 
as outlined in section 12. 
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INVERSE TABLES OF PROBABILITIES OF ERRORS OF THE 
SECOND KIND 


By Emma LEHMER 


University of California, Berkeley 


1. Introduction. The problem of testing linear hypotheses was discussed by 
Kolodziejezyk [1], and later in greater detail by Tang [2], who computed a table 
giving the probabilities of errors of the second kind Py for a range of values of 
the two degrees of freedom f; and fo , (f1 = 1(1)8, fo = 2, 4, 6(1)30, 60, ~)* and 
for two fixed levels P; = .01 and .05 of the probability of errors of the first kind, 
These tables are in terms of a parameter yg, (g = 1(.5)3(1)8) whose statistical 
significance, or rather that of 


A= (fi + 1)¢'/2 


is discussed in Tang’s paper.: A restatement of the problem of testing linear 
hypotheses in a more canonical form, giving an interpretation of \, will also be 
found in a recent paper by Wald [3]. 

Professor Neyman has felt for some time that a table giving ¢ = ¢(a, b, a, 8) 
as a function of the two degrees of freedom f; = 2a, and fe = 2b, and of the two 
probability levels a = P; and 8 = 1 — Py would be more useful for statistical 
purposes, where £ is the probability of detecting the falsehood of the hypothesis 
tested. A paper by Professor Neyman explaining this point of view and giving 
applications of the present tables to some statistical problems will appear 
shortly. These tables were computed in the Statistical Laboratory of the Uni- 
versity of California,’ and give values of ¢ for the following range of parameters: 


(a, B) = (.01, .7), (.01, .8), (.05, .7), (.05, .8) 
fi: = 1(1)10, 12, 15, 20, 24, 30, 40, 60, 80, 120, ~. 
fo = 2(2)20, 24, 30, 40, 60, 80, 120, 240, «. 


2. Analytic definitions. The statistical parameter 
(1) \ = (a, b, a, 8) = (a + 4) oC, b, a, 8) 


can be thought of as an inyerse function connected with the hypergeometric 
distribution. Inverse functions y(a), u(a, a) and x(a, b, a) of the better known 
normal, Gamma and Beta distributions respectively have all been tabulated, 


1 The notation m = r(s)t is equivalent tom = r,r+s,r+2s,... ,t. 

2 These tables were begun by Miss Leone Gintzler, and were carried on by Mark Eudy 
under a University of California Research Grant. The bulk of the computing was done, 
however, by the author and by Mrs. Julia Robinson under a grant of the American Philo- 
sophical Society. 
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and are sometimes called “percentage points” of the distribution. To begin 
with the simplest, the normal distribution, we may define y(a) as the solution of 


1 y(a) ~stp 
(2) val. e* "dr = a.* 


This function has been recently tabulated by Truman Lee Kelley [4] for a = 
p = 0(.0001) 1 to 8 decimal places. 
The function u(a, a) is the solution of 


1 u(a,a) 
(3) < I 6 dt = Ty(a)/T(a) = a. 


This is connected with the well known percentage points of the x? distribution 
with v = 2a degrees of freedom as follows: 


(4) x(a) = 2u(a, 1 — a). 


Catherine Thomson [5] has tabulated x3(a) for a = .005, .01, .025, .05, .1, 
25, .5, .75, .9, .95, .975, .99, .995, and for » = 1(1)30(10)100. She has also 
tabulated [6] the corresponding parameter x = x(a, b, a) of the Beta distribution 
with », = 2b, ve = 2a degrees of freedom defined by 


1 prtedian B.(a, b) 


(5) Bab, &§ G-Od= FCs) 


fora = .005, .01, .025, .05, .1, .25 and .5 and », = 1(1)10, 12, 15, 20, 24, 30, 
40, 60, 120, ve = 1(1)30, 40, 60, 120, © to five significant places. 
Similarly \(a, b, a, 8) can be defined as the solution of 


1 z(a,b,1—a) 


6  3BG@,d) eA) — 4) F(—b, a, —M) dt = 1-8 
’ 0 


where 


sie Y “tbs, ... 
P,4,9) = 1+ 54+ Ga * + 


is the confluent hypergeometric function. 


3. Limiting cases. It is well known that as a tends to infinity 
x; (a) u(a, 1 — a) y(a) (2) 
-— SS Ci 1  -_ = is 
@ ; a + Vata 
There are many approximations [7] to x’. In a recent paper Peiser [8] gave a 


rigorous derivation of an asymptotic formula for x’. 
Similarly, the limiting cases of x(a, b, a) as a and b tend to infinity are known 
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[9] and follow readily from (5), although no attempt has been made, as far as | 
know, to find better approximations. These limiting values are as follows: 


(8) lim a[l — x(a, b, a)] = u(b, 1 — a) 


ao 


(9) lim bz(a, b,a) = u(a,a). 
b—00 


The two corresponding limiting cases for \ are not at first glance so symmetric, 
When a tends to infinity, we have from (1) 


(10) a 
amo A 
while 


(11) lim F(—b, a, —ag’t) = (1+ ¢’t)’. 


a->o 


Substituting these in (6) and letting ¢ = 1 — z/a(1 + ¢’) and passing to the limit 
we get with the help of (8) 


1 oo 


12 = 
( ) T'(b) (1492) u(b,@) 


e 72’ 'dz=1-—8. 


In other words 
(1 + ¢°)u(b, a) = u(b, B) 


or 
” —. oe 
(13) etal 4/28 L. 


This is the only case, except for b = 1 in which ¢ can be given explicitly. 
For b = 1, we have from (5) 


x(a, 1,a) = A/ 


and (6) can be easily integrated to give 


g(a, 1, a, 8) = E (=) /(c + saa _ Vi=a]. 


In all other cases it was found impractical to attempt an inversion of (6) to get ¢. 
When b becomes infinite (6) becomes with t = z/b, and with the help of (9), 


. u(a,l—a) 2 (a—1) /2 a2 
(14) e I " (=) Ig (2\/dz) dz = 1-8 


where 


er 


(15) es(2VX2) = “A lim F(-2, " -™) 
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is the Bessel function of a purely imaginary argument, which is usually defined by 


Gy" 
_*< 2 
In(z) = 2 = oe ee | 


Expression (14) was also obtained for this limiting case of the hypergeometric 
distribution by Wishart [10]. This integral, however, does not give \ any more 
explicitly than the general integral (6), and since the calculation of \ increases in 
difficulty as a increaseX an attempt was made to derive an approximate formula 
for g(a, ©, a, 8) for large a. To this end an asymptotic formule [11] was de- 
veloped for J,(na), the principal term of which is 


a+ =" ( fi. +e#—-1 ay 
Substituting this into (14) with x = 2 -~W)dz/a — 1 and n = a — 1 we get for 
large a 


Mum 
a—l1 —t(l—¢*+to a. on 1 _— 
(17) 7 I te dt B 


If we assume that ¢ is sufficiently small to neglect the term in ¢* we get as a 
first approximation 


(1—¢?) u(a,1—a) esi 
(18) maf edt =1-—8 


or 


“ - _ u(a, 1 — 8B) 
(19) g(a, ~, a, B) mo 


a formula very similar to (11). In fact since a is large this formula can be 
reduced one more step using (7). This gives 


(20) lim ~/ay'(a, ~, a, 8) = y(a) — y(8). 


Similarly (13) becomes 
lim Vag'(%, b, a, 8) = y(a) — y(8). 
If instead of neglecting the term in ¢*, we multiply: it by the value of ¢ at its 
upper limit, we get 


2 1- V1 — Aula, 1 — a) — u(a, 1 — A/a 
(21) ? u(a, 1 — a)/a 








Professor Neyman derived another approximation for g(a, ©, a, 8) by assuming 
that the distribution (14) approaches a normal distribution for large a. He 
obtained: 


"8 2yia) | ¥*)\"" 
(2) oe — yay (1 + We + ERY” 
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Both (21) and (22) obviously reduce to (20) in the limit. The following table 
shows the efficiency of these formulas for a = 60 


Table (21) (22) (20) 
. =, 687 695 668 640 
B=. 


647 
.603 


= .05 057 540 044 529 
= 7 


A rigorous derivation of some such formula giving the actual order of approxima- 
tion of g would of course be of interest, but is likely to be quite complicated. 


4. Calculation of tables. It is fairly obvious that the integral (6), although 
very useful theoretically is not well adapted to actual calculations. It can 
easily be integrated by parts to produce the infinite series. 


aS Bisa +7,b) +» Sd‘ B.(b,a + 1) 
(23) . <1 i! Bia +i,b) = i! Ba + i, b) 


This series can be used effectively for calculation purposes only if \ is compara- 
tively small. If b is an integer, however, this series can be replaced by a finite 
series of 6 terms, which was also used by Tang [2] in calculating his tables. 
This series is as follows with x = x(a, b, 1 — a) = 2(b, a, a): 


= 1-68. 


b—1 
(24) eo“ — 2)" DT = 1-8 
t==0 


where 

T=1, Ti = 2fAd — 2) +a+b-— 1)/01 — 2) 
and’ 
(25) T, = «{fA0 — 2) +a+b — nj Toi + AtTy-2}/n(1 — 2). 


The subjoined tables can be thought of as inverses of Tang’s tables, and could 
have been obtained from tables such as Tang’s by inverse interpolation, had the 
interval of tabulation been sufficiently fine. The interval of tabulation of .5 for 
g allowed only a crude approximation or trial value of y, the corresponding 
probability was then calculated for this point, and then corrected with the help 


3 It will be noticed in comparing these formulas with those given by Tang, that z is used 
for1 — z. This is done to conform with Miss Thomson’s table for z. 


4 
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of derivatives. In the beginning of the work, a recalculation was usually 
made for the corrected value of g, and the tabulated value of g was then obtained 
by inverse interpolation between these very close values. As the work pro- 
gressed, and difference tables were calculated in several directions, the guesses 
improved considerably so that a correction could be made using the first deriva- 
tive to give a tabulated value of ¢ correct to three decimal places. Such corrections 
never exceeded .004, and therefore it is hoped that the tables are correct to the 
last place. The derivative in question is given by 


(26) 2 = (20 + Iya(l — 2) 7, , 


and was obtained as a by-product of the calculation of (24). This method was 
used for all finite a’s and for all b’s less than 30. For b = 30 or more it was 
found more expeditious to use the infinite series (23), about 20 terms of which 
sufficed. 

The values of x(a, b, 1 — a) = x(b, a, a) used in these calculations were ob- 
tained to five significant places from Miss Thomson’s table [6] for a = .01 and 
05 with »; = 2a, ve = 2b. 

No calculations were made for non-integer b’s since for small values of b, 
\ was too large to make the infinite series (23) practicable, while for b = 7, ¢ can 
be easily obtained by interpolation. The only available method for calculating ¢ 
for b = 1/2, 3/2 or 5/2 would be by numerical integration of (6), which would be 
rather lengthy. Furthermore, the interest seems to be in large rather than small 
values of a and b. 


5. Calculations for infinite cases. The case a = © was readily disposed of 
using (13), as for b = o, the integral (14) was again integrated by parts to give 


en 00 u (a+v) /2 a 

(27) oo (*) Top(2/ur) = 1 — B. 
v=0 

This was found to be effective, especially when u < \, which is the case for small 

values of a. When u exceeded \, the complementary series was used, namely 


iil co u (a—1—») /2 — 
(28) oe (*) Ta-1-o(2\/uX) = B. 


The calculations proceeded in much the same manner as in the finite case. The 
values of 2u were obtained from Miss Thomson’s table of percentage points of 
x; (a) distribution with » = 2a degrees of freedom, while the values of J) and J; 
were obtained from the tables of Bessel Functions [12] for 2+/rz < 20, and from 
the tables of Anding [13] for larger values of the variable. The values of J, for 
v > 1, were computed from the recurrence relation* 


cosh z 


‘ If ais an odd integer, the values of J,(z) can be built up using (29) from J_4(z) = 


inh 
and J;(z) = a , which of course are tabled. 
Wz 
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(29) : Ty41 — -% I,(z) + I,-1(2). 


Future calculations of this sort could make use of the forthcoming tables [14] of 
I,(z) for v < 20, and z < 20 to ten significant places. As before, the tabulated 
values were obtained by correcting trial values of g by means of the derivative 


(a—1) /2 
(30) - = +/2u(2a + le °™ (*) I.(2+/ ur), 


which again was obtained as a by-product of the calculations. This method 
becomes impractical when a is too large, because a great deal of accuracy is lost 
in applying recurrence (29) many times. For some of the larger values of a it 
was found preferable to use the series 


ren ru(a + 2) 

= i! Tia + 4) 
although it converges rather slowly. In other words the upper limits for a.and b 
were pushed as far as was practicable. 

The values which are tabled to two instead of three decimals were interpolate 
using second differences, all other values were computed in the manner described 
above. Difference tables were made by rows, columns and between tables, as a 
final check on the work. Difference tables using harmonic interpolation were 
also made for both rows and columns, and found very effective, with the exception 
of the lower right hand corner, where ¢ drops rapidly to zero. The last column 
of each table is to be used for harmonic interpolation. 


(31) 1-—-B=e 
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SOME COMBINATORIAL FORMULAS WITH APPLICATIONS TO 
PROBABLE VALUES OF A POLYNOMIAL-PRODUCT 
AND TO DIFFERENCES OF ZERO 


By L. C. Hsu 


National Southwest Associated University, Kunming, China 


1. The main purpose of the paper is to establish some combinatorial formulas 
concerning the mathematical expectations or probable values of a product of 
n given polynomials. The problem may be stated more definitely as follows: 

Let %1, °°: , X, be m non-negative discontinuous variables for which we have 
assumed that the probability that each x takes a possible value is equally likely, 
and let fi(z), --- , fx(x) be nm given polynomials. Then we shall ask: What 
is the probable value of the product fi(x:) --- fx(an), provided the sum of the 
variables 21, --- , 2%, is known? More generally, we may consider the problem 
with certain restrictions to x such asa < 24; <b, (@ =1---n). 

By a limiting process’ it will be found that all the formulas established for 
the preceding problem can be extended to the case of continuous variables. 
On this account, it is important to find explicit formulas for the problem merely 
involving discontinuous variables. 

By the definition’? of MacMahon, we say that a set of numbers (21 --- Zn) 
is over all different compositions of m into n parts with each x > k, if (a «++ 2p) 
runs over all different integer solutions of the linear equation 7; + --- 2%, = m 
with each x > k. We shall use the notation (m; k; x1 --- %,), or simply (m; 
k; x), to denote that a set of numbers (z; - -- z,) is over all different compositions 
of m into n parts with each x > k. 

The notation E(m; 5; [fi(x)] - - - [fn(x)]) will be used to denote the mathematical 
expectation of the product fi(21) --- fn(an) in which the sum of n variable quanti- 
ties 41, +--+ , X, is known, namely x; + --- + 2, = m, and each quantity is a 
multiple of 6 and m is of course a multiple of 6. Thus by the definition’ of 
mathematical expectations we have 
(1) E(m; 8; (fil +++ (fl) = ( D1 Dr falas) «++ fa(tnd), 

(m/ 5; 1; z) (m/6; 1; z) 
where the summation on the right-hand side runs over all different compositions 
of m/é5 into n parts with each x > 1, and the given constant 6 is called a “varying 
unit”, that is the least possible difference between two unequal quantities in 


(v1 ---2,). If the varying unit approaches zero, (1 --- Xn) will become a set 
of continuous variables. 





1 The limiting process will be illustrated by the proof of corollary 2 of theorem 1 in this 
paper. 

2? MacMahon, Combinatory Analysis, Vol. 1, p. 150. 

3 See for example W. Burnside, Theory of Probability, Chap. 4, 13. 
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If fi(z) = --- = f,(x), we may write 
E(m; 6; [f]") instead of E(m; 6; [fil --- (fal). 


A well-known convention for (™) is also adopted here: 


m! . 
(") - je-or = *ses" 
n 


0 otherwise. 


2. Lemmas. In order to obtain explicit formulas desired we first establish 
four lemmas as follows: 
Lemma 1. Let m, 11, --- , Tn be non-negative integers. Then 


1 (t\ _ m+n-—-1 
(2) Hae 3a) 


Proor. Construct a generating function: 


1 ry+1 1 Tat1 
(4) (4) : la| <1. 


It is observed that the coefficient of the term 2” “'***' +” in the expansion of the 
above product is given by 


- es). Tn + Xn\ “ M1\ | (%n 
(m—r4—+ + -—rp3 0; z) I Ln (m;0;2) \11 fa, 


On the other hand we see that the coefficient of the term 2” “!*"***™") in the 


ry +---+1ry_tn 
expansion of ) is given by 


1-—z 


a 


m— (i+ --> +f.) In+n—1 


Hence the lemma. 
Lemma 2. Let a, b,c, --+ be arbitrary constants, and k; , ke , kz , -- + be positive 
integers. Then 


a % Ly Le 
Se We(e) + (6) + oe) +) 
= n! Zz (, mtn — 1 San 
" (niaby-++) \aki + Bke + yks + --- +n — lJ al Bly! 


Proor. Expanding the left-hand side of (3), we see that the coefficient of the 
term a*b’c" --- is equal to 


at = (#)--(2\() (Ce (rr)--e) 
al Bly!. °° (m;0;z) ky ky ke ko ks ks 


> 


(3) 
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By lemma | it is reduced to 


n! ( mt+n—1 ) 
alBly!--- \aki + Bho + ykg+ ---+n—1)° 
Substituting, we get the lemma. 
From now on, we shall frequently write f~ instead of f(x), so that 


fg” + rw ooo ("10 = (f+9). 


LemMa’3. Let m,n (< m) be two positive integers. Then for any given poly- 
nomial f(x) of the kth degree we have 


a 7 m+tn—-1\P((f— 1)” 
(4) Xp Sz) +++ flan) = nt Aen FA +n- :) IT p»! 


> 
where 


f° =f), Sp) =1l-prt-+++ kp. 


Proor. Since f(x) is a polynomial of the kth degree, there exist (& + 1) 
values 6; , --* , Bo Such that 


a(7) fee + a(7) + Bo = f(z). 


By putting x = 0, 1, --- , k, we find successively* 


(6) B® =f” — or forte (ys = (f — 1)”, 


(v = 0,1,--:,k). 


The lemma is thus obtained by means of (3). 
For convenience, we denote the summation 


, Silas) -++ fa(tn) by S(m, [fil --+ [fal). 


Thus formula (4) can be rewritten: 
» +n-1 Bo° Br" , 
/ SS s on ! m )e ee a a 
(4) (m, [f]") =n SiR caiai po! pi! Dp» ! 


where Bo , 1, °*+ , Bx are given by (5). 
Lemna 4. Let fi(x), --+ , fn(x) be n given polynomials, Then 


6 Smifl--fd=-t. ¥ (HD SC Uf +o + Sal)» 


n! (pyres Ee(Leeen 


‘Strictly speaking, the relation (5) is established and proved by induction on », in 
which a well known combinatorial equality is applied. 
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where the summation on the right-hand side runs over all different combinations 
out of (1---n), (kK = 1---n). 

For example, if n = 3, then all the different combinations out of (123) will 
be: (1), (2), (8), (12), (13), (23), (128). 

Proor. Consider a typical term 


ta S(m, [fr,]” ree [f>,]**), 


where] <¢< n,qa.+--: +q:=n. Nowa necessary and sufficient condition 
that the term will be contained in the expansion of S(m, [fu, + --- + fy,]") is 
(vy, ++ vee(ui --- we), ie. (v1 --- v2) is a combination out of (wu --~- ux), and for 


a fixed k, there are . os -different combinations of (4: --- wu.) satisfying the 


condition (vy; --- v,)e(ui --- wee(1 --- 7). Therefore the number of occurrences 
of the term in the right-hand side of (6) is given by 


¥ (-n" - ‘ =(1-)™'= ° if 


v=0 Vv if 


The term vanishes generally except gq. = --- = q: = 1. Hence the right-hand 
side gives S(m, [fi] --- [fnl). 


3. Theorems and corollaries. In the following statement of theorems and 
corollaries the notation (x; --- z,) will be always used to denote a set of unde- 
termined quantities as specified. 

THEOREM 1. Suppose that (a --- xn) is a set of natural numbers in which only 
the sum of the numbers is known, namely x1 + --- + 22 = m. Then, for any 
given polynomial f(x) of the kth degree, the mathematical expectation of f(x1) --- (an) 
1s given by 


E(m, 1, [f]”) 
PP gy n! E * iy ( m+n—1 yg — 1)] ..- (if — 1)]* 
“\n-1 (nop) \S(p) +n — 1 po! +++ pe! . 
Proor. Let m’ = m-+nr. By lemma 1 we have 


2, Sn\ _ _ _ fm’ -ar+n-1 
5) G)- 31> 3 1-( n-1 ) 


This is the number of compositians of m’ into n parts with each part > r. In 
particular, if r = 1, we find that the number of compositions of m into n parts is 


e = : ) Thus by (1), the required value is equal to 


1 


{S(m, [1)}*S(m, [f1"), ice. E: 


n—l1 


) S(m, [f\"). 


The theorem is therefore proved by lemma 3. 
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Corotuary 1. Let (a1 --- xn) be a set of positive quantities, of which the 
varying unit is 5, and the sum is m. Then, for any given polynomial f(x) of the 
kth degree, we have 


E(m, 5, fl") 


6 -»(G)- 





n-—-1 


) x () ae ) (g — NP ---  — 


(730; p) S(p) +n-—1 Po! ee px! 
where 
g(x) = f(zs), = =9Sp) =mt+--+ +h. 


Proor. It follows immediately by the relation: 


Bim, 6, iter") = # ((™), 1, yey"). 


CoroLuARy 2. Let (2 --- Xn) be a set of non-negatively real numbers, of which 
the sum is known, namely x, + --- + 2, = m, m being a known real number. 
Then, for any given polynomial f(x) = ao + --- + ay", (ax ~ 0) we have 

E(m, 0, [f]") 
(9) nt)” > re (Olac)"° = (kYaz)%* 
¥ nn (n;0;p) (1-q + +++ + kqx +s- 1)! qo! gx! ; 

Proor. The proof of the corollary depends essentially on the concept 
that two unequal real numbers may differ by an arbitrary small number h. 


Let h be an arbitrary positive number and write f(xh)/h* = g(x, h), where 
the number k is the degree of f(z). Then, since 


(0 
Bc Ceo 
2 


we may write 





a,b) — (*) 9 — 1,0) + ++ 1 (?) 90,8) = Hota, + HR), 


where 


lim R,(h) = {( 
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Now we pass to the limit h > . in which we assume that h runs through a sequence 


of real numbers of the form ™” , N being a natural number. 


WN? Thus by corollary ] 


we have 


lim E(m, h, [f]”) 
h-0 


nl Tim z,(G)+9- ‘( 2 ys jo Tp lee + he RCD 


S(p) +n —-1/\ n-1 ei 


ni >) 4lim (Ci) +» sre 01()— =) wh 
(nip) | ho (S(p) +n — »1((”)- S(p »)((™) - i) 


{tim II pla, + tA-R, wr, 


h—0 v=0 pv! 


‘ 


n\(n hes 1) bmS? k (vla,)”” 
os cue iantni ; S(p) = 1- 7 i 
(Fy SO) Fn = Hiss pl ae ee <a 
Hence the corollary.’ 

Corotuary 3. Let (4 --+ Xn) be a set of positive real numbers under a known 
condition a < 1 + +--+ + tn < b, where a (< b), b are two positive real numbers. 
Then, for any given polynomial f(x) = ao + -:- + axe", (a, ¥ 0), the mathematical 
expectation of the product f(a:) --- f(t») which we denote by E((ab), 0, [f]") ts given 
by the formula 
n\(n — 1)! pts@ _ gits@ 

b—a (nia) (1 + S@))-(n — 1+ S(qg))! 

ag? +++ (kl ay) 
qo! eee gx! 

Proor. Since the probability that the sum of 1,---, xz, takes a value 
between a and b is equally likely, we see that the required mathematical expecta- 


tion will be the mean of [ E(u, 0, [f]")du, that is 


(10) Z((ab), 0, Lf") = 


» (S@ =at --> + ky). 


E(ab), 0, y= 2 [Bu 0, Uf") au. 


5 This corollary can also be proved by means of Dirichlet’s integral. In fact, the right- 
hand side of (9) is given by the quotient of the two integrals: 


(SJ ... SFG) ... fGen) dar ... da) /V ... [dar ... Ee), 


the integrals being taken over the region: 21+ ... +2, =m,21>0,...,2%n>0. 
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The formula (10) is therefore obtained by integrating the right-hand side of (9) 
and dividing it by (b — a). 
On the other hand we find that 


lim E((a, a+ h), 0, if") _ E(a, 0, [f]”). 


This shows that the corollary 2 can be also deduced from 3. 
THEOREM 2. (A generalization of theorem 1). Let fi(x), --- , fn(x) be n given 
polynomials whose degrees do not exceed k. Then we have 


E(m, 1, [fil «++ (fal) 
( m+n-—1 ) 
"es S (ayy tS) + 0 = Fy Gan — DOV! 


? 
(vyesevp) € (Leven) (ni0sp) m—1 j=0 D;! 
n-—1 
where 


Joy---ng — fv, (2) + -*-+ fi. (x), S(p) = 1-pi + +9 + k-p;. 


Proor. In the proof of theorem 1 we have shown that 


Bm, 1, r) = (™— 1) sem, U1. 


Hence, by similar reasoning and lemma 4 we obtain 


E(m, 1, [fil a [fnl) 7 : t) S(m, (fil pan [fn]) 


n 


_ (—1)"* 

ae a m—l1 
“An -1 
m+n-—1 ) 


(-prt S@) +2 = 1 


(pyeeovele(le+sn) (nj0;p) a 1 
a- |] 


k (3))P 
[(fos---%, — or 
I] D; 


S(m, [fos e+ > f,1”) 


Theorem 2 is proved. 
Corotuary 1. Let 6 be a varying unit. Then 


m 
Bim fl = (3) +" “ ) 


vieeems)e(Leoem) (niOsp 
(p) +n—-1 


( ~ ) Ty Gee = DOPE 


n—1 i=0 p; 


(12) 
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where 








= fi(xd), (v = 1 --- n) 


Proor. This is almost trivial because of 


Elm, 0 (fale) =~ fe) = B((™), 1, [Gos] ~~ [alas)]). 


6 . di 
Coro.tuary 2.’ For any given positive real number m, we have 


gy(x) and Jvy---¥, = Gov, (x) yf vee Gv, (2). 







Pr)... fa) = pi! : Pai (n — 1)! mPrtectPn 
(18) Bom, 0, fo = fo) me Pet Pa UN pent 


Proor. By a passage to the limit 6 — 0, we - 










lim E(m, 5, [fil can (fal) = lim = a ~ Elm, 6, [fv > + fr,)") 











oni n—t ' “ 
7 a lim E(m, 8, [f,,---»1]")- 
nN: 6-0 


E(m, 0, [fil ss - (fal) — ae . aa — E(m, 0, [f>.-- ml )- 










The corollary is then deduced by (9). 
THEOREM 3. (Further generalization of theorem 1). Let (a; ---+ x,) be a set 
of arbitrary integers restricted to the conditions: 


w+ coe 


where m, a, b are all known integers. Then for any given polynomial f(x), the 
mathematical expectation of the product f(x1) --+ f(an) denoted by E (m, 1, [f]’) 
(a,b) 






+ 2%, = ™, a<2<b, 










is given by the following 


> (17 (”) som’, lta) 


(14) s * 1,if")== 


Eo \) 


where 

















g(x) =f(ie+b), h@) =f(@+a-—1), m’=m—(a—1)n+ (a—b—1)r 


Proor. Define 











0 for m> 0, 


S(m, [f!") = 0 form <nand S(m,[fl’) = 
1 for m= 0. 


6 This can also be deduced by Dirichlet’s integrals. 


We 


lA & 
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We shall show that 
2D (=1)" ~ Som’, ol) = fle) ++ Hen), 
a<z<b 


where on the right-hand side of the expression the set (x; --- 2,) under the 
summation runs over all different compositions of m into n parts with each 
a<au<b@=1---n). 

We denote the left-hand side of the expression by S, then by decomposing 
S(m’, Ig) [h]" ”) we have 


S= z r (~ »"(”) S(m, [fw + b)’)S(m' — m, [f(@ +a — II”). 


Let f(t) --- f(&,) be a term contained in G, i.e. %: + +--+ +2, =m, % > a,---, 


z, >a. And we suppose that z%, > b + 1,---,%,, > b+ 1, where »; wl Vj 
ifi ~ j. Then it is found that the number of occurrences of the term in © is 


given by 
0 if ¢2>1, 
Ev()-a-v-| rin 


s=0 1 if ¢=0. 


This shows that the term f(%:) --- f(%,) of S generally vanishes except a < 2 
<b. Hence we have 


S= Do f(x) --- flan). 
cxest. 
Next, we shall find the number of compositions of m into n parts with each 


a <2; < b,i.e. the number of terms in S. By the result just obtained we see 
that the number is given by 


reer) D1 ys 


“Eo gC-it 
- £ar()(¥-7). 


Hence the theorem. ; 

The theorem just proved shows that the mathematical expectation 
E (m, 1, [f]") can be expressed by S(m, [g]’) and is therefore expressible’ in 
(a,b) 


terms of the linear combinations of the coefficients of the polynomial f(z). 


CoroLuary 1. Let 6 be a varying unit for which sg : ; are all integers. Then 


E (m, 3, [fa)I") = E (*, L, (sez). 
= @) 


7 See lemmas 3 and 4. 





respectively. 


¥y=0,-+ +, P_mO 


Corotuary 2. Let f(x), --- 


-. (m, 6, [fil --- [fal) = 
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vyreeve)e(Le++n) 


Coro.iary 3. The number of compositions of m into n parts with each a; < 2, 


(¢ = 1--- n) is equal to 


(— 1)"*" ss+rn 


yy=0,-+ +r _=0 


m+n — (a, + +++ + an) + (1 — by — Ir + ++ + (Qn — bn — 1), - 1 


v==0 


vy=0 v,=0 


1,-+-,l 


vy y=0,---,v,=0 


taken from these stores in 


(— os ( 


er 
Hence the number of integer solutions of the equation 


Iu + -2° + kin, + ote tay + +++ + ton, —_ 


with each a, < x, < b(v = 1---s;n = 1--- n,) is given by 


See De (epee TI 


(mjl;m,-++m,) ¥1=0 


Proor. We have shown that the number of compositions of m into n parts 
with each a < x < bis given by 


> (-1)" 


8 


(mljm;) t=1 


(nen)... 


{” -l- > (a = ln + > (a = bi = 1) 
at---+a,— 1 

The corollary follows at once by putting n; = --- 

The last corollary may be restated as follows: Let there be n stores, 

-++, b, being the numbers of stocks contained in Ist, 2nd, --- 

Then m stocks containing at least a; stocks of the ith store can be 


m+tn—1—).a+ > (a—b — In 











> In(x) be n given polynomials. 





5, (m, 6, [fv llc + fl’). 


—(a—1n+(a-—b-—-1)y—-1 


m— (a; = 1)n; + (a; = b; = 1)»; -] 


8 _ 


oT i (a; > 1)n; + (a; = b; = 1)»; —] 


=n, =1,s=n. 


— different ways. 
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We have now established several combinatorial formulas concerning the 
mathematical expectations of the product fi(x:) --- fa(zn) under certain condi- 
tions. Apparently, there are many examples which can be solved by means of 
the results just obtained. For brevity, we may state a general criterion as 
follows: The mathematical expectation of a function, F(x, ---, %n) say, can 
be estimated by the above mentioned formulas, if and only if 1) the sum of 
%1,°°* , 2, iS known, and 2) there exist n polynomials fi(x), --- , fx(z) such 
that F(a ,*-- 2,) is proportional to f;(x:;), (¢ = 1--- mn). The undetermined 
quantities in (a1, --- , %,) may or may not be continuous, if the quantities are 
discontinuous, the varying unit is necessarily known. 


4. Convenient formulas for differences of zero.* 
Given f(x) = Bo + Bix + --- + Byx"(B, ¥ 0) we may write 


k k 
(f-1)” = 2d v!B, Sy. = Dd, BA’ 0", 


s= 


where S,,, is a Stirling number of the second kind, as used by Jordan and defined 
by 


v!S,. = 0’ = >> (-1)"" (") 2, 


z=0 
A’0’ being in the language of the calculus of finite differences, “‘a difference of 
zero’’. 


In terms of the differences of zero, the formulas (7) and (11) may also be re- 
stated as follows: 


ny _ (m+n — 1)!(m — n)Inl(n — 1)! 
ee Pin tn = S@SO) +m — HI (m— It 


k 
x IT Gao" + +++ + Bart)”. 
ve Py: 


Em itil I= 


1°*"ved 


(m+n —I)!(m—n)!(r— 1)! | 
(n50; p) (m — S(p))!(S(p) + n — 1)!(m — 1)! 


k 
x II = (B, 4’ 0’ 4+.) + B, A’ 0°)”’, 
verQ Pys 


where 
f(t) = Bot +++ + Bt’, Bu=Bit-->+Bnu, Sp) =1-pit+ +++ +hpe. 


§ The methods for obtaining convenient formulas for differences of zero as stated in the 
first part of this paragraph are similar to those used by Paul S. Dwyer in his paper ‘‘The 


computation of moments with cumulative totals,’? Annals of Math. Stat., Vol. 9 (1938), 
pp. 288-303. 
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The formulas (7)’ and (11)’ tell us that the difference of zero plays an important 
réle in the calculation of mathematical expectations of a polynomial product 
under known conditions. On account of this fact, we are now going to investigate 
some recurrence relations and approximations for the differences of zero. 

As m is larger than t, we may find a convenient recurrence relation as follows: 


A™ m+t 

mr ~ Some = MOTT) + molt a) + 
m+t 

(15) — ( ” ‘) 


Amomtttt m+t+1 
a Smim+t+1 = A(t + 1) (" i+ t) 


+ ult +1) isa + tree os '), 
















where 
¢+ IAA +7:A = A(t) = u(t + 1) = 1, 


and d2(t), --- , A(t) are all independent of m but depending on ¢. 
Starting with the first equation of (15) and using a well known relation (due 
to Jordan) Smnsi = Smin + m-Smn, successively, we get 





A(t + 1), Ao = Atgil(t) = 0, 
















SmimttH = a (m — v + 1)Sn—s41m4e41—9 


= Eno d(T Fmt 


-EMoE (Teper Jersey 
HEC} 
Eo titaeriso+ (tity 
. = > {t+ J)Aja + Ar} (” Te * 


t+j+ 


The recurrence relation thus follows. 

By successive applications of the relation \;(¢ + 1) = (£ + j)-Az-u() +9-A50, 
after nth time say, we may express ) ;(¢) as a linear combination of \,(t — n), ---, 
d,-n(t — n), but the coefficients are too complicated. 
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For t < 9, by applying the recurrence relation as obtained above, the coeffi- 
cients may be exhibited as follows: 


Ae(t) | As(¢) a(t) As(t) de(t) Az(t) As(t) do(#) 





3 

10 15 

25; 105 105 

56; 490 1260 945 

119} 1918 9450 17325 10395 

246) 6825, 56980) 190575) 270270) 135135) 

501) 22935) 302995) 1636635) 4099095) 4729725 2027025] 


| 1012; 74316 1487200) 12122110) 47507460) 94594500 91891800/34459425 


Cersanowe| = 


For example, when ¢ = 4 we have, according to the table: 


amon! = (" : ‘) 4. 25 (" : *) + 105 (” : *) + 105 (" : \\ ie 


$ ° ° +¢ 
We shall now proceed to find some approximations for Sy,.., and A”0"™. 
Firstly, we may write 


n qntt 


t+ 1 
According to the recurrence relation we have 
(i) A(t) = (2t — IA — 1), 

(ii) Ara(t) = At — 1)-Arw(t — 1) + @ — 1AM — J, 

(iii) A,-o(t) = (2t — 3)-Ava(t — 1) + (f — 2)-Ave(t — 1). 


Hence 


nl 2 


(24)! 


Az(t) = i. 2! . 


A(t) = (t — 1)!2°*e(); 


Nat) = 247 T - 2- Pe- 2-)-- 18)-0¢ -1- A, 


where 


o(k) = > ?. (t — 1.5); = (t — 1.5)(t — 2.5) --- @ —j — 0.5). 


pear Q2z x 


Evidently, the orders of i ' * i) ‘- ' + 4 a ainey iy 7 are all less than 2¢ as 


n tends to infinity. 
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Now, it can be easily found that 
so (*a ')~ ta (ar ') = G) a +5) 
x) 0+)0-2) 5") 
na) + + (5) 
as0(3, ; > His ')o- = ee < ) M( 
iO EGO) 
2 (37 3) =in() imran + OG) 


Hence, we may write 


(16) Saarve a(S) +e + Pp o(4 :), 








I 
















where . 
- aan a 4'-tt!o(t) 
a=t+2 (7) oft) =t+ “apr? 
1s oe ™ 4'-t1t!(2t — 1) 
p2 = —el(t — 1)(2t — 1) + — Qpr a(t) + ir aa Ar-o(t). 












Moreover, it can be shown by Wallis’ formula that 


4/24 <2 (7) < 4/2. (x = 1, 2, 3, --°). 
2° rg 
Thus we have 


1) = BE (") <Eg/E cf s/se= ee: 
o(t) = : x =) > ¥ 4/2 > [ 7 2 dz = wt (t — 2)! 


Again, by Wallis’ formula we have 


vais a(Z) <i. 
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Combining these inequalities we get 


ai (t — 2) < 4! [ o(t) < iV + @-, 


- : 2 2 
$Vt (t — 2)! ~ 32, W <i (-1)~#. 


Therefore, . 


t+ aViVvet—2<m<t+3vVvitiWWF—1), 1 ~ 3. 
Next, by Stirling’s formula 


taf") «ee a 4 i 
= (") Ven ! T jon t agen + o(4)} 
we obtain 


" (n™!* ~/2xn ott. f= — 2 
norte _ (n\' (m\ V2an A, O 1\\ on /2an 2 
_—* (°) (5) t! t + n ¥ n? ™ o(4)} e”.2'.¢! re =) 


where 


2 
=atyds~ i, 


= - + YaP1 + zee: 
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RANGES AND MIDRANGES'* 


By E. J. GUMBEL 
New School for Social Research, New York City 


1. Introduction. In the following the generating functions of the extremes 
are studied in order to determine the nature of the distributions of the ranges and 
the midranges. 

A large sample of size n is considened to be drawn from an unlimited synfmetri- 
cal continuous distribution with zero mean. The difference and the sum of the 
largest and of the smallest observation, the extremes, are called range and 
midrange. R. A. Fisher and L. H. C. Tippett [2] have established the limiting 
distributions of the largest and of the smallest member of a sample. The exact 
conditions under which these distributions hold have been given by R. von 
Mises [4]. For a normal distribution, L. H. C. Tippett [7] has calculated the 
numerical values of the mean range and the first four moments of the range for 
sample sizes varying from 2 to 1000. He has shown that, for sample sizes 
exceeding 200, the correlation between the largest and the smallest observation 
may be neglected. Later, E. S. Pearson [5] has calculated the probability 
function of the range for small samples (n = 2 to 20) taken from a normal 
population. These calculations are very laborious. Recently, W. E. Deming 
[1] has applied the range to quality control. 

The concepts “‘extremes’’, ‘‘range”’ and ‘‘midrange”’ allow a simple generaliza- 
tion. Let »x and x, be the mth observation in increasing and in decreasing 
magnitude, henceforth called mth value “from below” and “from above’”’. As 
long as the index m is small compared to the sample size n, the mth values under 
consideration are extremes. The difference and the sum of the mth extreme 
observations are called the mth range and the mth midrange. We will investigate 
the asymptotic distributions of the mth extremes, of the mth range, and of the 
mth midrange. Assuming that the number of observations is very large, the 
correlation between the largest and the smallest observation may be neglected. 
Then the mth range and the mth midrange are the difference and the sum of two 
independent variates, the mth extremes. 

It was found that the distribution of the mth range is skew and the distribution 
of the mth midrange is of the generalized logistic type, which is symmetrical. 
For m increasing the distributions of the mth extremes, the mth ranges, and 
the mth midranges converge toward normality. 


2. Generating functions of the mth extremes. Let ¢(x) be an initial con- 
tinuous symmetrical distribution with mean zero; let um be the most probable 
mth value from above; let a, be defined by 


(1) Om = — (Um). 


1 Research done with the support of a grant from the American Philosophical Society. 
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Under conditions given in an earlier paper [3] the distributions fn(1m) and mf(m2) 
of the mth extremes are, for large n and small m 


(2) Sul m) = om Gey 5 ft) = Sa — tm) 


where 
(3) Yu = Gn(Zn — Un) 


is a reduced extreme mth value. 
The moment generating function G,,(t) of the mth extreme value from above 
obtained from the preceding equations is 


+4. 
G (2) => ml etm! [ - emt ym(t/am)—me— vm a 
" (m 7 1)! co ™m™- 


Introducing 
e "= ¢ 


the integral becomes 


eo 
[ fs dz=T (m = +) mt lew 
0 Qm 


whence 


(4) Gn(t) = e*™ m7 (m — 4) / T'(m). 


To obtain the moments of the mth extreme value from above, the fundamental 
property of the Gamma function jis used; whence 


pel = - Am 


ae HFS = 
aay = dimstenn Ff p(y_ 2), 


Finally, by reversing the order of multiplication the moment generating function 
of the mth extreme value from above becomes 


(5) G,(t) = eht!@m) (umamttom) TI ( sam <.) r ( —_ +) z 


v= 1 Vm Am 


The mean Z,, of the mth value from above obtained from (5) is 
m—1 
(6) im = tm + 2 (tom - F 3 +0) 
Am pol V 


where c = .57722 is Euler’s constant. 
The seminvariant generating function L,,(¢) of the mth extreme value from 
above becomes from (5) and (6) 


t (= 1 = t t 
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and after expansion 


co t’ 
(7) Lad) = 2, > Som 

2 Vm 
where the sums 
(8) Bas — 2 = v = 2 
are obtained from the sums 
(8’) S&S = d+ 

kan Kk” 


which are known from the theory of the Gamma function. The numerical values 
of the seminvariants of the mth extreme value from above },,m, being the coeffi- 
cients of ¢’/y! in the expansion (7) may be calculated from a table of the sums 
S,, m given in an earlier paper [3]. 

From the generating functions (4) and (7) of the mth extreme value from above 
the moment generating function ,,G(¢) and the seminvariant generating function 
mL(t) of the mth extreme value from below are obtained by the symmetric 
relation (2) as 


and the mean ,,% of the mth extreme value from below is 
(6’) at = — &£,. 


The seminvariants Am,, and mA, of the mth extreme value from above and from 
below are linked by 


(10) Omr\m,» = (v — 1)!8,,m = (—1)"ammdA, 


The standard errors o,, and ,,0 of the mth extreme values are 
(11) nim = WSs.n = Amms. 


This procedure for obtaining the moments of the mth extremes from their 
distribution (2) parallels closely that used by R. A. Fisher and L. C. H. Tippett 
[2] for obtaining the moments of the largest and smallest value. The special 
case m = 1 of the formulae (4), (9), (6), (6’), (11), and (10) leads to the moment 
generating functions, the means, the standard errors, and the higher semin- 
variants of the largest and of the smallest value given by these authors. 

The two parameters um and am which exist in the distribution (2) of the mth 
extremes may be calculated from the observations by virtue of equations (6) and 
(11). Thus, the theoretical distributions (2) may be compared to observations 
even if the initial distribution g(x) is unknown. For increasing m, the distribu- 
tions of the mth extremes were shown [3] to converge toward normal distribu- 
tions, their means and standard deviations being given by (6), (6’) and 11. 








oda? ise © - ®@ 
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3. Generating functions of the mth range and the mth midrange. To obtain 
the characteristics for the mth range and the mth mid-range we state first some 
general properties of the sum and of the difference of two independent variates 
end y, with means < and 7 and standard deviations o, and o,. Let the distri- 
butions be g(x) and y¥(y), and let the generating functions of the seminvariants 
dz, and A,y,, be L(t) and L,(t). We write 


(12) v=x+ty w=2-y 


from the sum and the difference of the two variates. Then the means i and ® 
and the variances o2 and o%, are 


(13) D=E+9; D=F-GF; =or +o, = 0% 
and the seminvariant generating functions L,(¢) and L,,(t) are 
(14) L(t) = Lat) + L,@; Lut) = Lt) + L,(-2). 


The negative sign in the second equation (14) is obtained through the same well- 
known derivation as used for the first equation (14). Therefore, the even semin- 
variants of the sum are equal to the even seminvariants of the difference, whereas 
the odd seminvariants of the sum and of the difference are 


Aov2vtt = Azov4a + Ay.ov4s 3 Nvoret = Azores + (—1)" Ayers « 
If the distributions g(x) and ¥(y) are symmetrical one to another in the sense 
(15) vy) = o(—2) 


then the even seminvariants of the two variates x and y coincide in size and sign 
and the odd seminvariants coincide in size and differ in sign. Under the condi- 
tion (15), the even seminvariants of the sum v and the even seminvariants of the 
difference w are twice the even seminvariants of the variates x or y. The odd 
seminvariants of the sum v are twice the odd seminvariants of the variate z, 
and the odd seminvariants of the difference w vanish. 

We apply these properties to the mth extreme values and write z,, for x and »x 
for y. According to (2) the distribution of the mth extreme from above is sym- 
metrical to the distribution of the mth extreme from below in the sense (15). 

The mth range w,, and the mth mid-range v, are defined by 


(12’) Wm = Im — mX> Um = Im + ml. 


The mean @,, of the mth range, the mean 3, of the mth mid-range and the 
respective variances o;,, and o»,, are, from (6), (6’) and (11) 





‘ 2Se,m 
(13) @, = 22, ; dn = 0; Fim = 207, = 2,,0° = 3 = t. ; 


The seminvariant generating functions L,.(t, m) of the mth range and L,(t, m) of 
the mth mid-range obtained from (7) and (9) are 


(14) L(t, m) = s 37 Sum = 2m(t); Lot, m) = pa 


ye? Vi 


e om m 
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The seminvariants of the mth range are twice the seminvariants of the mth 
extreme values from above. The even seminvariants of the mth mid-range and 
the even seminvariants of the mth range are twice the even seminvariants of 
the mth extremes. The odd seminvariants of the mth mid-range vanish. There- 
fore, the distribution of the mth range is skew, and the distribution of the mth 
midrange is symmetrical. From the convergence of the mth extremes towards 
normality it follows that the mth range and the mth mid-range tend also, for 
increasing indices m, toward normality. 

The seminvariants of the range and the mid-range are obtained from (13’) 
and (14’) by putting m = 1 and omitting the index 1. Therefore, the standard 
errors o, and o, of the range and of the mid-range are 


(13”) AS» = Ai = V2 on = avy 


where a is given by (1) and o, stands for the standard error of the first or the 
last value. The skewness (:,, of the range and the excess 62 — 3 for the range and 
the mid-range 


(16) Biw = .64928 ; Bow — 3=12= Bee — 3 


are only one half of the corresponding characteristics of the largest value. The 
distribution of the range is less skew and less concentrated toward the middle 
than the distribution of the largest value. The moments (13) and (16) of the 
range and of the midrange may also be obtained directly from Fisher’s and 
Tippett’s results [2] when independence of the two extremes is assumed. The 
numerical values (16) for the limiting distribution of the range differ considerably 
from the values 


(16’) Biw = 809; Baw —3 = .54 


given by Tippett [7] for n = 1000 observations. For increasing n the approach 
of the distribution of the range toward its limiting distribution is very slow. 
This is reasonable as the approach of the distribution of the largest value toward 
the limiting distribution is also very slow. 

Until now we considered symmetrical initial distributions. The case of an 
asymmetrical initial distribution may be dealt with briefly. The most probable 
mth extreme value from below ,,u differs then from — wu» and 


(1’ ‘met = * (tt) 


differs from a». The distribution of the mth extreme value from below is for 
large n and small m [3] 


™ 


/ a m Mmy—memy 
(2’) afuit) = ma (m — 1! é 
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where 
(3’) mY = m2(mt — mu) 


is a reduced mth extreme value from below. The moment generating function 
wa(t) of the mth extreme value from below becomes 


(4’) ma(t) = em 'm “=p (m + £) / T'(m) 
and the moment generating functions G,,(¢, m) and G,(t, m) of the mth range and 


of the mth mid-range of an asymmetrical distribution obtained from (4), (4’) 
and in analogy to (14) are 


Gult, m) = elem mq)! Mem) + me) (m = +) r (m _ £) / rm) 
a ma 


G,(t, m) oni ef mt m4) yy Clam) — Cl ma)) 1 (m — +) r (m + £) [rm 
An ma 


Thus, the moments of the mth range and the mth mid-range may easily be com- 
puted even for an asymmetrical distribution. 


(17) 


4. The distribution of the mth midrange. In the following we return to a 
symmetrical distribution and establish directly the distribution f(v») of the mth 
midrange. Then, the generating function (14’) and the convergence toward 
normality will be verified. 

From (12’) the distribution of the mth midrange is 


f (Um) = ” Sm(Xm) mf (Um = Sun) Sq. 


Introducing (2), the equation is written 
2m +00 
m [ —My mtMA ny (0 m—Zm tm) —me— V m—me® m(%m—Zmtum) d 
co 


fm) = Om (m— 1)? Sen 


Using as before 


the integral becomes 


3 MA mV 
enim? m I gem emai tea mem) de = e ™(Q2m - 1)! 
0 m*"(1 + e*™’™)™ 


The distribution of the mth midrange is therefore 


(2m — 1)! " Bitel 


(18) f(Om) = Om (m — LP aq+te==)"" 
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The distribution (18) will be shown to lead to the seminvariant generating 
function (14’). The generating function of the mth midrange obtained from 
(18) is 


ee [ efimtmme (tam) Cy 4 gimPm\—2™ oy ay 
Introducing ; 
+e" ™ =u; du = —wWe™"™ dann 
the integral is rewritten 
(2m — 1)! 


1 
[ (1 ee qtr pees du 
0 


_ 2m) r(m ° x)r( _ = 


(m — 1)? 





T?(m) I'(2m) 
Consequently, the moment generating function of the mth midrange is 
r(m++)r(m- +) 
(19) G,(t, m) = oe rs 


This expression is identical with the product of the moment generating functions 
(4) and (9) of the extreme mth values. Therefore, equation (18) is the distribu- 
tion, and equation (14’) is the seminvariant generating function of the mth 
midrange. 

For m = 1 the distribution (18) of the mid-range becomes the so-called logistic 
distribution which is commonly written 


—av 


’ = ae 
(18’) fe) = a4+e™) 


Accordingly, (18) may be called the generalized logistic distribution. The proba- 
bility F(v) of a value equal to, or less than, v obtained from (18’) is 


(20) Fi) =1/a+e™). 
Therefore the distribution f(v) may be expressed by the probability F(v) through 
(18”) fv) = aF(v)\( — Fe)). 


Formula (20) considered as a growth function plays a réle in population 
statistics where it was introduced about 100 years ago by Verhulst [8]. Recently, 
it has been widely used by R. Pearl [6]. In his treatment, the value v stands 
for the time, and the function F(v) stands for the relative size of the population 
at time v compared to its alleged asymptotic size. 


fo ap tate . 3 
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In the following, the influence of m on the distribution (18) is studied. The 
distribution ¢,(z) of the reduced mth mid-range 


(21) Z = anim v 
is, from (18) 
(2 nd = 5 eS 


Vim (m — TP (1+ evn) 


The probability density of the mean mth reduced mid-range increases with m. 
Indeed, 


m+1(0) = 2n+ 1 
$m (0) 2 m(m + 1) 


Therefore, the standard error of the reduced mth mid-range decreases with 
increasing index m. This is reasonable as the mth mid-range is an estimate of 
the median for the initial distribution. The larger m, the nearer are the mth 
extreme values to the median, and the better is the estimate. 

To verify that the distribution (18) of the mth mid-range tends toward nor- 
mality for large indices m equation (22) is rewritten 


z/Vm 
lg bm(z) = lg dm(0) + 2r/m — 2mlg (te), 


Expansion of the exponential and the logarithm leads, if we neglect the third 
and higher powers of the deviation z, to 





FY 2? 2? 
lg dm(z) = lg ¢m(0) + 2v/m —~ 2m (= + dn a o—_* ++) 


4m 


whence by virtue of (11) 
—a* mv" /4 
five) = Const enn", 


The distribution of the mth mid-range becomes normal for large indices m. 
The mean is zero, and the standard deviation is 


(23) ~ 


This is in accordance with the statement (13’), as Se, tends with increasing m 
toward 1/m. 


5. Summary. For initial symmetrical distributions the seminvariant gen- 
erating functions (14’) of the mth range and the mth mid-range are obiained 
from the seminvariant generating functions (7) and (9) of the mth extreme 
values from above and from below. As the two mth extreme values are sup- 
posed to be independent our results hold only for very large, sample sizes. The 
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even seminvariants of the mth mid-range and of the mth range coincide. The 
distribution of the mth range is skew and the distribution of the mth mid-range 
is the generalized symmetrical logistic distribution. For increasing indices m 
the distributions of the mth extremes, the mth ranges, and the mth mid-ranges 
converge toward normality. 
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NOTES 


This section 1s devoted to brief research and expository articles on methodology 
and other short items. 
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NOTE ON ASYMPTOTIC VALUE OF PROBABILITY DISTRIBUTION OF 
SUM OF RANDOM VARIABLES WHICH ARE GREATER THAN A 
SET OF ARBITRARILY CHOSEN NUMBERS 


By Braprorp F. KimBa.u 


New York Public Service Commission, New York City 


The purpose of this note is to present the following theorem which the author 
needed in connection with a computational problem. Since the theorem has 
general implications for statistical theory which do not seem to have been 
brought out heretofore, readers of this journal may find it of interest. 

THEOREM: In Euclidean space of n dimensions with coordinates x; (i = 
1, 2, --- n) let a; be n constants whose sum is uo, and consider a hyperplane 


(1) M+ wet: +t = 4, U> UD. 
Let k denote a positive constant, and I(u) the n — 1 fold integral defined by 


(2) I(u) = | ee | exp. | -a(3, “)| da, dx, +++ dXn-1 


taken over that part of the hyperplane for which x; > a;. Then 
(3) lim v/n ” iia I(u) ie (x/k)"-??, 


uo 


Proor: The integral may be reduced to another integral by reduction of the 
quadratic form 
n —1 n—1 2 
> 2 = Dit (u- > x.) 
t=1 i=l i=1 
to a sum of squares y; such that y; does not involve x; for j <i. Dropping the 
superscript n — 1 in the notation for = and letting the subscript on = denote 
the least value of 7 involved in the sum, the expansion of this form may be written 


2>>23 -—-Qudxant+2> 22;4+ v, d < Z. 
1 1 1 
The transformation may be performed progressively as follows: 
(V2 m1) + 2 (tat ++ wat ((u- Li 2i)/V2P = vi 
giving 
y=V2u— (u- X x:)/V2. 


423 
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Using the remainder of the terms involving x2, one can complete the square 
with terms not involving x2, and the value of y-2 is 


Y2 = Xe V3/V2 -@- > «i)/V6. 


3 


Continuing until only terms involving z,_; and u remain, 
Yn = Ini Vn/YVn — 1 —- u/S n(n — 1). 


The remaining term in wu will be found to be u’/n. 
Making the above transformation, the integral becomes 


(4) I(u) = eval | eee [ ex.[ -# (= s’) | dy: dy2 +++ dyn. 


In order to fix the limits of integration on y; it will be noted that the projection 
of the critical region of the hyperplane (1) upon the n — 1 dimensional space in 
the original variables x; delineates a region in that n — 1 space bounded by the 
nm — 1 hyperplanes 


Y= a, 
and the hyperplane 
Mtetrwmte: $i = uU—a. 


Hence, if (2) is considered as an iterated integral with the integration per- 
formed in the order of the subscripts of x; , the intervals of integration are 


ay Su-—a— > 2: 
2 


2 te SU—-aq—a— >) % 
3 


<u-Dox- (uo — >) as) 
r+1 


r+l 


Qn-1 S Xn SU — Uy + On ? 
n—1 


where it is recalled that >» denotes >> 
ers 


t=r+1 
Now transforming to y; and using the general transformation equation 


(5) Yr =2r-Vr + 1/Vr - (u— Lo ai)/Vo + Dr, l<r<n-l 
where > x; = 0 under definition of summation symbol noted above, 

—_— limit of y, = — (u — d ti)/V (ir + Drtadr+i/vVr, 

Upper limit of y, = (u — dX ti) Vr/Vr +1 -— (wo — a, ai) Vr + 1/vr, 


since 


Vi Fir — VG$ Dt = V/V FL 
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‘It is not difficult to show that 
2d, ti = Crarlys) + (n — 1 — r)u/n 


where C’,:(y;) denotes a linear combination of y; for i > r + 1, which does not 
involve the variable u. In other words 


u— De = (r+ l)u/n — Crarlyi). 


Making this substitution, the limits on y, are found to be 
Lower limit of y, = —(u/n)(Wr + 1/+/r) 
+ Craalys)/V@ FI + arr + 1/Vr, 
Upper limit of y, = (u/n)(Wr(r + 1) — Cray) r/Vr + 1 
— (wo — D0 air + 1/Vr. 


r+1 


It will now be clear that as wu becomes infinite, the limit of the integral in (4), 
will be the n — 1 fold integral taken over the whole of n — 1 space. This latter 
integral is easily evaluated to give (1/k)‘»”, and the theorem follows. 

The following two corollaries, which are restatements of the theorem in less 
general form, bring out the implications for statistical theory. 

CoroLuARY 1. With k = 1/2 define F,(u) by 


(6) F,(u) = (24)""I(u). 


The differential F,,(u)du represents then the probability that n random variables x; 
taken from a normally distributed population with zero mean and unit standard 
deviation, fall into a region 


(7) a; < Ls 


and have a sum with value in the neighborhood (+3 du) of u. 
Recalling that w% is the value of w when each 2; has value a; 


(8) Pla: < 2) = [ Fa(u) du 


is the probability that all values of x fall into the region (7). Denoting the normal 
probability function by $(t), corollary 1 implies that 


(9) lim Vn F,(u)/o(u/Vn) = 1. 


Since $(u/A/n)du/+/n represents the probability distribution of the sum of 
the n random variables when the condition (7) is removed, certain implications 
for the theory of statistics emerge. One of these is noted in the example given 
below. Corollary 1 can be stated in a different form as follows: 

Coro.tuary 2. If pa(u)du denotes the probability that the sum of n random varia- 





426 BRADFORD F. KIMBALL 


bles from a normally distributed population be in the neighborhood (+4 du) of u, and 
Pa(u) denotes the probability that the sum of n random variables from the same 
population that do fall in region (7) have a value in this neighborhood of u, it follows 
from (9) and the nature of the functions integrated that for arbitrarily small positive 
5, a value of u, say u’ can be found, sufficiently large, such that for allu > w’, 


(10) (1 — 6) pa(u) < pa(u) Plai < xi] < palu). 


Rate of convergence. The author having had occasion to compute F,,(u) for 
values of n from 2 to 5, and a; = 0, a table showing the rate of convergence of 
F,,(u) to its limit for this range of n (and a; = 0) is shown below. In this table 
the values of the ratios of the minimum values of u (= wu’) to the standard devi- 
ation of u (= +~/n) are shown for a sequence of values of 6 which approaches 
zero. 


Rate of Convergence of ~/nF,(u)/o(u/n) to Unity for a; = 0 
Critical Ratio u'/~/n 

6 n=2 n= 3 n= 4 
a) 0.67 1.34 1.92 
25 1.15 1.95 2.65 
re 1.64 2.60 3.40 
.05 1.96 3.02 3.89 
.O1 2.58 3.85 4.87 
.001 3.29 4.84 6.03 


ll 
on 


46 
27 
11 
.65 
.76 
.08 
Example. An example showing the possible bearing of this theorem upon 
practical considerations of sampling is the following: If in a quality control 
problem, samples of size 4 were used, and a follow-up of samples which showed a 
large deviation of sample mean were pursued, a case of a particularly large devi- 
ation such as 4 would possibly receive special attention. With sample mean 
equal to u/4, the ratio of this mean to its standard deviation is u/2. Turning 
to the table, in column n = 4 it will be noted that the value of 6 for w’/2 = 4is 
somewhat less than .05. Hence from (9) and (10), for u > w’ (u’ = 8, n = 4), 


95 b(u/~/n)/Vn < Fru) < (u/Vn)/Vn. 


SIo mR Bob § 


It follows that 


95 [ ¢(u/Yn) du/V/n < [ F,(u) du < [ o(u/+/n) du/V/n- 


If one now considers a set of random samples of size 4, the last integral on the 
right represents the expected: proportion of the set which falls into the sub-set A 
for which u > w’ (and hence with deviation of mean relative to standard devi- 
ation of mean greater than u’/+/n). The middle integral represents the expected 
proportion of the original set which falls into a sub-set B for which u > wu’ and 
x; > 0. It follows from the inequality on the left that the expected proportion 
of the sub-set A which falls into the sub-set B is greater than 95 per cent. Hence 
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one infers that the probability is greater than .95 that for a sample showing such 
a large deviation from the mean (u/+/n = 4, n = 4) all the constituent elements 
will have deviations on the same side of the population mean. Thus if all the 
elements of the sample investigated are found to have deviations on the same 
side of the population mean, this could not be construed as additional evidence 
that the sample indicated an abnormal condition. 

This conclusion is weaker than the facts of the example warrant, since it is 
based upon the integral of F,,(u) from wu’ to infinity. Unfortunately the author 
does not have data available on the rate of convergence of these integrals. 


NOTE ON A MATRIC THEOREM OF A. T. CRAIG 


By Haroip Hore..ine 
Columbia University 


An extremely elegant theorem given recently by A. T. Craig’ and applied 
by him to establish a further theorem on independent x’ distributions may be 
stated as follows: 

If A and B are the symmetric matrices of two homogeneous quadratic forms in n 
variates which are normally and independently distributed with zero means and unit 
variances, a necessary and sufficient condition for the independence in probability 
of these two forms is that AB = 0. 

The proof given that the condition is sufficient is adequate, but Craig’s 
treatment of its necessity consists essentially in its assertion. In view of the 
growing interest in such quadratic forms, for example in connection with serial 
dorrelation, the neatness of this theorem is likely to lead to a wide usefulness. 
It therefore seems worth while to give a complete proof of the necessity condition. 

The form with matrix A is denoted by Q; and that with matrix B by Q. 
The characteristic functions, if defined as Ee*! and Ee’, are respectively the 
reciprocals of the square roots of the determinants of the matrices 1 — AA and 
1 — uB, while the characteristic function for Q; and Q2 together, Bette) | 
is the reciprocal of the square root of the determinant of 1 — AA — wB. A 
necessary and sufficient condition for independence is therefore that 


|1—AA|-|1—uB| =|1— AA — pB| 
shall hold identically for all values of \ and yw. Since the determinant of the 


product of two matrices is the product of their determinants, the left member is 
the same as 


|1— AA — wB+ AB. 


From this it is immediately obvious that AB = 0 implies the independence of 
the two forms. The converse will now be proved. 


1 “Note on the independence of certain quadratic forms,’’ Annals of Math. Stat., Vol. 14 
(1943), pp. 195-197. 
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We shall assume therefore that Q; and Q, are independent, so that the identity 
holds, and prove that AB = 0. 

It must not be supposed that Q, and Qe can by the same linear transformation 
be reduced to forms in which product terms are absent and only terms in the 
squares of the variates appear. The available theorems’ leading to this canonical 
form require that at least one of the quadratic forms be non-singular. But it is 
of the essence of the present situation that both Q; and Q, be singular, since this is 
implied by AB = 0. It does not appear possible, for example, to reduce to this 
canonical form the pair x} + x3 and xz} + 223. 

Nevertheless a real orthogonal transformation can be found reducing Q, to 


dai + --- + d,2;, 


where r is the rank of Q;. Thus there exists an orthogonal P such that A = 
PLP™ and B = PMP™, where L and M, when partitioned so as to separate the 
rows and columns into successive groups of r and n — 1, are of the forms 


[elr |: 


Here D stands for an r-rowed diagonal matrix having d; --- d, in its diagonal, 
0 for various matrices whose elements are all zero, and E, C and F for arbitrary 
matrices of appropriate dimensions. Then 


}1—AA| =|P(1—dAL)P'|=|P|-|1—-AL|-|P*| =|1-aAL|, 
and in the same way, 
|1—uB| =|1— uM | 
and 
|1—AA — wB| =|]1-—-AL— pM |. 
We thus have 
[1-—-AL|-|1—~M|=|1—-—AL—ywM |. 


From this identity it follows that a pair of forms Qi and Q> , quadratic in a set of 
variates normally and independently distributed with zero means and unit 
variances, and having matrices L and M respectively, are independent. 

Since AB = PLMP™", the theorem will be proved if we can show that LM = 0. 


Let 


so that M = M,+ M2. Since obviously LM2 = 0, we need only to show that 
LM i= 


2 E.g., Bécher, Introduction to Higher Algebra, pp. 169, 305. 
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Let QF = Q’ + Q”, where Q’ and Q” are quadratic forms in the particular 
normally distributed variates considered, and have matrices M, and Mz respec- 
tively. Since Q” obviously does not involve any of the variates occurring in 
Q; and since all the variates are independent it follows that Q1 is independent of 
Q”. Since it has been shown above that QT is also independent of Q: , it must be 
independent of the difference Q; — Q” = Q’. Therefore 


}1—AL|-|1—uM,| =|1—AL— uM, |. 
We have: 


[1 — AL| = II a - ip. 


ili ~ [Lowes |=] 
1 = a= a = [2 = 2 A=} 


Consequently equating the terms of highest degree in \ on the two sides of the 
identity 


(1 — Adj) | 1 — pM, | =|1— AL — wpM, | 
yields the identity in y, 


}1-—4M,|= 


? 


or upon putting np = 1/z, 
| M, — x| = (—2)”. 


Hence all the latent roots of the real symmetric matrix M, are zero. Now for 
a symmetric matrix the sum of the squares of the latent roots equals the sum 
of the squares of the elements, since both equal the trace of the square of the 
matrix. Therefore M,; = 0. Consequently LM, = O and the theorem is 
established. 

Since M, = 0, the following further result is obvious: 

Two independent quadratic forms in a set of variates normally and independently 
distributed with zero means and a common variance can by a transformation be 
reduced to two forms having no variate in common. 

But one of the disjunct sets of variates in the forms as thus reduced is not 
necessarily independent of the other set. For example, if 21, 22, %3, 24 are 
normally distributed with equal variances and any fixed non-vanishing correla- 
tion, the same in each of the six pairs, the sets (a , x2) and (x3 , 24) are not inde- 
pendent of each other, but the forms (a + 22)” and (x3 — a4)” are, since 2; + 22 
is uncorrelated with x3 — x4. 

I am indebted to Professor E. R. Lorch in connection with the preparation 
of the present note. 
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A NOTE ON THE BEHRENS-FISHER PROBLEM 


By Henry ScHEFFE 


Syracuse University 


A commonly occurring problem of statistical inference is the comparison of 
the means of two normal universes when the ratio of their variances is unknown, 
Let (%1, --- , 2m) be a random sample from one normal population with mean q 
and variance yu, and (y:, --- , yn), a random sample from another with mean g 
and variance v. The problem is then that of making statistical inferences 
about the difference 6 of the means, 6 = a — 8, when the ratio u/y is unknown, 
Convenient tests and confidence intervals are available if one can find a linear 
form L and a quadratic form Q in the vector (1%, --+ ,2m, Y1,°** , Yn) With 
coefficients independent of the unknown parameters a, 8, u, v, such that for some 
positive integer k, the quotient 


(1) (L — 8)/(Q/k)* 

has the ¢-distribution with k degrees of freedom. For this it is sufficient that the 
following conditions be satisfied for all values of the parameters: (7) L and Q are 
independently distributed, (ii)' E(L) = 6, (iti) Q/o” has the x’-distribution with k 
degrees of freedom, where o” is the variance of L. 

In a recent paper [1] the author investigated the Behrens-Fisher problem as de- 
limited by the above three conditions,” and among other results arrived at a simple 
solution. This solution however does not have the property that the quotient 
(1) is symmetric, whereby in this note we shall mean the following: A function 
of the samples and parameters will be called symmetric if it is invariant under 
permutations of the z’s among themselves and of the y’s among themselves, 
Let us therefore formulate condition (iv): the quotient (1) is symmetric. Since 
(iv) would be extremely desirable, both for practical and theoretical reasons, 
and since the author has received several inquiries on this matter, it is considered 
worth while to outline a proof that conditions (77) and (777) imply that (iv) cannot 
be satisfied, in other words, there exists no “symmetric solution” of the Behrens- 
Fisher problem within the framework we have imposed. Perhaps this is a 
simple example of a larger class of problems in which an approach, natural in the 
light of past developments, forces us to an asymmetric solution. 

Suppose (iv) is satisfied. By substituting special values for the vector (2, 
“++, 0m, Y1,°**, Yn) and then making permutations allowed by (iv) we find 
that L and Q must be of the form 


(2) L=akauteadiy, 
i 7 
(3) Q=ed tites x, xxi + cs 2. yi + cs 2d yiys + er Dy iY; 
t ix 7 ii’ ts) 


1 E(f) denotes the expected value of f. 
2 Although these conditions appear simpler than those in [1] they may be shown equiva- 
lent. 





BEHRENS-FISHER PROBLEM 431 


where all c’s are independent of parameters, and the range of indices 7, 7’ is 
from 1 to m, the range of j, 7’ from 1 to n. 
Condition (zz) requires that 


(4) E(L) = a-— B. 
But from (2), 
(5) E(L) = cyma + conB. 


Since (4) and (5) must be satisfied identically in a, 8, it follows that c, = 1/m, 
¢ = —1/n, and thus the variance of L is 


(6) ¢ = p/m + v/n. 


Because of condition (iii) we must have E(Q/o’) = k, and combining this with 
(6), we have 


(7) E(Q) = k(u/m + v/n). 
However, from (3), 
(8) E(Q) = cym(u + a”) + ca(m? — m)a’ + egn(v + 8’) 
+ c6(n” — n)8° + cymnaf. 


Kquating (7) and (8) gives us an identity in a, 8, u, v from which we can determine 
the c’s, and after putting these back in (3) we find that the result may be written 


(9) Q = K{S./(m’ — m) + 8,/(n* — n)], 


where 


S.= ) (1 - 9, #=)) 
Ss, = X y;-9, G= Dy;/n. 


x;/m, 


2 

The last step of the proof consists of showing that Q defined by (9) violates 
(iii). Write u = S./p, uw = S,/v. Then uw and wz have independent x7- 
distributions. Now (9) states that u = Q/o’ is of the form u = ayw, + are . 
where a; and a, are constants. Let (t), o:(t), ¢2(t) be the respective characteristic 
functions of u, uw, U2. Then o(t) = d;(at)de(aet) because uw, and wz are statisti- 
cally independent. Since the characteristic function of a x’-variable with r 
degrees of freedom is (1 — 2it)~”, it is evident that wu has a x*-distiibution if and 
only if a, = a = 1. From (9), 


ai/az = [»/(m* — m)}/[v/(n* — n)}, 


and thus a necessary condition (it is also sufficient) for Q/s° to have a x’-dis- 
tribution is 


(10) u/v = (m° — m)/(n’ — n). 
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But (iii) states that Q/o’ has a x’-distribution for all parameter values. This 
contradiction completes the proof. 

We remark in closing that we have at hand a counter example of practical 
interest to the statement found in several statistics texts that if z is a normal 
variable with zero mean and » is an independent unbiased quadratic estimate of 
the variance of z, then z/ v' has a ¢-distribution. The counter example consists of 
taking z =  — 7 — Sand v = Q/k defined by (9). It may be shown that z/y! 
does not have a ¢-distribution except in the trivial case (10). 


REFERENCE 
{1] H. Scuerr&, Annals of Math. Stat., Vol. 15 (1943), pp. 35-44. 


ON MULTIPLE MATCHING WITH ONE VARIABLE DECK 
T. N. E. GREvVILLE 


Bureau of the Census 


The problem of card matching has been considered by a number of writers, 
A complete bibliography has been given by Battin [1], who also published the 
most general treatment of the subject to date, dealing with the simultaneous 
matching of any number of decks of arbitrary composition. He considers, 
however, only the case in which the order of every deck is variable, all possible 
permutations being equally likely. Some interest attaches to the case in which 
all the decks but one have fixed orders in relation to one another, especially in 
connection with radio experiments in telepathy, where a large number of subjects 
simultaneously attempt to call the same target. 

The simplest case is that in which the target for each trial is chosen at random, 
independently of the other trials. If the target is to be selected from s possi- 
bilities, and if p; denotes the probability that the 7th possibility will be chosen 
as the target, while m; denotes the number of subjects who call the ith possibility, 
then the mean value of h, the number of correct calls is, of course, 


(1) M, = 2d Pim , 

and the variance is 

(2) Vi-= D pimi — Mi. 
t=1 


Evidently, the mean number of hits for a succession of trials is the sum of the 
means for the individual trials, and the variance is the sum of the variances. 
A slightly more difficult problem is presented when the target series is a true 
“deck’’: that is, when its composition is determined in advance, only the order 
being left to chance. Let n denote the number of trials and n;(i = 1, 2, --- , 8) 
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the number of targets of the 7th kind so that - n;i =n. Also let m;;(7 = 1, 2, 
- , n) denote the number of subjects who call the ith possibility on the jth 
trial, and let m; denote the total number of such calls in all the n trials, so that 
; Then, using the ingenious type of counting function introduced 

by Wilks [6] and improved by Battin [1], we define 


d= <I (= ne"), 


j=1 t=1 


where N = n! / [I ni!. We also define an operator K,, such that if u = u(a, 


i=] 
t, ***,%s), then K,,,u is the coefficient of ri'rz?--- 2st in u. Then, if h 
denotes the number of hits 
P(h = r) = coefficient of e” in K,,®. 
Also, 
E(i") = Ke, 2]. 
"* 00? | mo 


It follows immediately that 
(3) M, = =D mam = 2. Pim: , 


where p; is written for n:/n. Similarly, it can be shown that 


ls/ 2 1 < 
E(h’) = = dX \r 2 mis) + en De hee ~ i) & mma 


1 8 n 
+ nn — 1) 7 ee 2, mama | 


é,jut 
ixj kyl 


If we write \i; = >> mum, it is found that 
k=l 


, 1 . : 
(4) VY. = aioe a bP Di(ndis — mi) ae a, Di pi(Mri5 — mim) |. 
ad .2= 


It should be noted that it is not necessary that the total number of subjects be 
the same for each trial. 

Certain special cases are of interest. When there is just one subject for each 
target, as in the ordinary matching of two decks, \;; = m; for 7 = j, and 0 other- 
wise;and ym; = n. Then, if 7; denotes m;/n and p; = 1 — m;, andif gq; = 1 — pi, 


2 
(5) Mna=ndipirs and Vi = — [Zspigemips + Dai Pipi, 
a & 
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a compact expression which is equivalent to the forms previously given by 
Stevens [4, 5], Greenwood [2], Battin [1], and the author [3]. 

On the other hand, if the different kinds of targets occur with equal frequency, 
so that every p; = 1/s and every gi = (s — 1)/s; and if »% = Dyn and » = 
Dw: = L,v,, the expression (4) becoraes 


(6) Vi = 3 [? — n2evi + sdi(ndix — mi). 
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BOOK REVIEW 


“Student’s’”’ Collected Papers. Edited by E. 8. PEarson and JoHN WIsHART. 
Biometrika Office, London. pp. xiv + 224. 1942. 15 shillings. 


Since the question is still asked from time to time, it may be well to state that 
‘Student’ was the pen name of William Sealy Gosset. After taking his degree 
in mathematics and natural sciences at Oxford, Student worked as a brewer for 
the well-known firm of Messrs. Guinness—in Ireland from 1899 till 1935 and 
thereafter in London as head of the new Guinness brewery established there. 
He died in 1937 at the age of 61. Between 1907 and 1937 he published twenty- 
one papers and a few notes; these are issued in the present volume as a tribute 
from a group of his relatives and friends. 

Student’s name is almost universally attached to a single discovery, the t-test, 
which requires for a complete proof more powerful mathematical methods than 
he devised. These facts may cause his papers to be regarded as museum pieces 
by people who have not read them. Actually, in many respects no better model 
than Student could be suggested for a young statistician today. Since we some- 
times speak derisively of courses of lectures in statistics where the methods and 
ideas are ‘‘20 years out of date’’, it is interesting to find that many of the funda- 
mental ideas considered most in need of emphasis by forward-looking teachers 
today were in fact emphasized by Student in his writings over 30 years ago. 

For example, his classical paper on the ¢-test, published in 1908, opens as 
follows: ‘‘Any experiment may be regarded as forming an individual of a ‘popu- 
lation’ of experiments which might be performed under the same conditions. 
A series of experiments is a sample drawn from this population. 

Now any series of experiments is only of value in so far as it enables us to form 
a judgment as to the statistical constants of the population to which the experi- 
ments belong”’. 

The idea is elaborated in a second paper published in the same year. ‘Note 
that the indefinitely large population need not actually exist. In Mr. Hooker’s 
case his sample was 21 years of farming under modern conditions in England, 
and included all the years about which information was obtainable. Probably 
it could not actually have been made much larger without loss of homogeneity, 
due to the mixing with farming under conditions not modern; but one can im- 
agine the population indefinitely increased and the 21 years to be a sample from 
this.” We note here a further observation, not always appreciated, that in some 
lines of research samples which are both large and homogeneous do not exist. A 
clear conception of the relation between sample and population is evident in all 
his work: one further quotation, the summary from his 1926 paper, Mathematics 
and Agronomy, will suffice. ‘To sum up, in planning agronomic experiments use 
plenty of replications and make quite sure your results are capable of being con- 
sidered to be a random sample of the population about which you wish to draw 
conclusions.” 
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Another noticeable feature of Student’s work is his concern about the extent to 
which the mathematical model underlying his statistical techniques really repre. 
sents the facts of the data. Further, he is always ready to give his opinion, from 
considered thought, as to whether the discrepancies between the model and the 
facts will affect the conclusions materially. In his first paper, On the error of 
counting with a haemacytometer, he shows that the Poisson distribution applies to 
the total number of cells or corpuscles which are found in a division of the hae. 
macytometer field. He is careful, however, to point out three ways in which 
deviations from the Poisson law may occur in practice: (i) there may be inter- 
ference between the particles, though this is considered unlikely in high dilutiong 
(ii) there may be clumping and (iii) (probably the most important, in his view) 
the drops taken out for counting may not represent the bulk of the liquid, giving 
rise to an additional sampling error superimposed on the Poisson error. Ina 
later paper An explanation of deviations from Poisson’s law in practice (1919), he 
makes a critical examination of the assumptions necessary for Poisson’s distribu- 
tion, and shows the type of distribution to be expected from the invalidity of one 
or another of the assumptions. 

Similarly, in the opening paragraphs of the 1908 paper on the ¢-test, he notes 
that the assumption of normality in the parent distribution may not hold in 
practice. Then follows his opinion on the importance of this assumption: “It 
appears probable that the deviation from normality must be very extreme to lead 
to serious error.” 

His earliest papers deal with attempts to find several exact small-sample dis- 
tributions for which he and other workers in Britain felt a need. Lacking the 
mastery of probability theory necessary for obtaining a rigorous solution, he used 
a combination of mathematics, intelligent guesswork and inferences from repeated 
samplings which he drew. For example, the steps in his development of the 
t-distribution were as follows: 

(i) The distribution of the mean square, s?, being unknown to British statisti- 
cians at that time, had first to be found. Student calculated the first four 
moments by algebraic methods. Noticing that the condition for fitting a Pear- 
son Type III curve (2 82 — 3 8: — 6 = 0) held exactly, he inferred correctly that 
s? followed a type III curve. 

(ii) He was aware that the joint distribution of s* and the sample mean Z was 
necessary for a solution of the problem and further that the form of the joint dis- 
tribution depended on the correlation between Z and s?. Having shown by alge- 
bra that s? is uncorrelated with either Z or Z?, he assumed (without further com- 
ment) that the distributions of < and s? were independent. 

(iii) The final step to the ¢-distribution was performed rigorously. 

(iv) The result was then checked against empirical data. He used the heights 
of 3000 criminals, which provided 750 samples of 4. Another set of 750 samples 
was supplied by the lengths of the left middle fingers of the same criminals. 

In 1908 Student also published the distribution of the sample correlation 
coefficient r, in a bivariate normal population in which p is zero. His interest in 
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the problem arose from a meeting of the Royal Statistical Society, in which an 
inquiry was raised about the significance of correlation coefficients derived from 
small numbers of cases. The authorities present, Yule, Hooker and Edgeworth, 
ventured their opinion on a specific instance, but had no general answer. Ac- 
cordingly, Student set out to find the general distribution of r, without assuming 
pzero. His mathematics did not take him far; providing the solution only for 
samples of 2, where the frequency is zero except at —1 and +1. In order to 
learn what empirical results could teach about the form of the distribution, he 
then drew four sets of 750 samples, the sets having p = 0, n = 4; p = 0,n = 8;p 
= .66,n = 4; and p = .66, n = 8, respectively. 

He considered first the set having p = 0,n = 4. Now, for samples of 4, the exact 
distribution, C(1 — eos reduces to a rectangular distribution. For his 
purpose, it would seem that Student had been unlucky in his choice of n = 4, 
since the rectangular distribution appears to furnish little or no clue as to the 
general form of distribution when n differs from 4. Actually, things could not 
have turned out better. Referring, as in the case of the distribution of s?, to 
Pearson’s curves, he selected a type II on account of the limited range of the dis- 


r 0.272 ; 
. This sug- 


tribution of r. The fitted curve was found to be o(1 — — 


1.076 
gested to Student that the true form ought to be C(1—7*)°, giving a rectangular 


distribution ; the general form was guessed as C(1 — r?)". The samples of 8 
were then used merely as corroborative material. One wonders how Student 
would have fared, both in this case and in the case of s?, if the Pearson system of 
curves had not been familiar to him. The method did not of course provide the 


much more complex distribution when p is not zero; however, Student was able 
to write down several important properties of the distribution for the guidance 
of mathematical statisticians who might be interested in the problem. 

By similar methods he attempted (1921) to find the frequency distribution and 
the efficiency of Spearman’s rank correlation coefficient. Much later (1936), he 
remarked in connection with this coefficient that, though it was not fully effi- 
cient, ‘it was so simple that when playing with other people’s figures, for instance 
on a railway journey, it was the obvious one to use.” 

His paper, The elimination of spurious correlation due to position in time and 
space, though brief and incomplete in its treatment of the problem, established 
him as one of the pioneers in the use of the variate-difference method. His sug- 
gestion was to correlate the nth differences of x with those of y for n = 1, 2, --- 
until a point was reached where the correlation ceased to change. This limiting 
correlation would be free from spurious effects due to trend factors. Such 
methods, he remarks, help to show “whether there really 7s a close connexion 
between the female cancer death rate and the quantity of imported apples con- 
sumed per head.” 

His association with field experiments conducted in Ireland for the purpose of 
developing high yielding varieties of barley suitable for brewing led to an interest 
in the design of experiments, to which he devoted seven papers. His philosophy 
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on this subject, though it appeared to be increasingly out of harmony with the 
philosophy underlying Fisher’s methods, was a natural outgrowth of Student’s 
general attitude towards the applications of statistical techniques. He believed 
that, from general experience of soil fertility patterns plus local knowledge of the 
history and topography of the particular field in question, one could construct g 
systematic design which would be more accurate than the most appropriate 
randomized design. Secondly, although systematic designs rely on nature, as it 
were, to supply the random element necessary for the application of probability 
theory, he believed, again from knowledge of field conditions, that the discrepan.- 
cies between the mathematical model and the true fertility pattern would not 
seriously vitiate the estimate of error. How far he would have carried the advo- 
cacy of systematic arrangements is not clear. Nearly all of his discussions refer 
to arrangements for comparing only two varieties. Moreover, he stressed re- 
peatedly the necessity for carrying out such trials over several seasons and at a 
considerable number of places, in order to sample weather and soil variations, 
In such cases the statistical significance of a result at a single place did not interest 
Student greatly. 

A few quotations of his opinions on more detailed questions illustrate his under- 
standing of the attitude of the farmer and agronomist and his direct mode of 
expression. Speaking of experimentation with large plots, he says “‘it has the 
advantage that the farmer, who always has a healthy contempt for gardening, 
may pay some attention to the results.”” On adjustments to yield data to allow 
for variations in soil fertility, he writes: ‘‘There is a great disadvantage in cor- 
recting any figures for position, inasmuch as it savors of cooking, and besides the 
corrected figures do not represent anything real.’”? Long experience had made 
him more and more convinced that a serial correlation is inevitable in routine 
chemical analyses; accordingly, he advises: ‘““The chemist who wishes to impress 
his clients will therefore arrange to do repetition analyses as nearly as possible at 
the same time, but if he wishes to diminish his real error he will separate them by 
as wide an interval of time as possible.” 

Re-reading of Student’s papers has been a keen pleasure. It is hoped that the 
volume will enjoy a wide circulation amongst statisticians. 

W. G. CocHRaNn 





NEWS AND NOTICES 
Readers are invited to submit to the Secretary of the Institute news items of interest 
Personal Items 


Dr. P. H. Anderson, formerly of War Production Board in Cleveland, is now 
associated with the Bureau of Foreign and Domestic Commerce in Washington, 
D. C. 

Dr. T. A. Bancroft, formerly instructor at Iowa State College, has been pro- 
moted to an assistant professorship. 

Jarvis Barnes is now Supervisor of Statistics for the Atlanta Board of Educa- 
tion. 

Associate Professor W. D. Baten has been promoted to the rank of Professor 
of Mathematics and Research Professor at the Michigan Agricultural Experi- 
ment Station of Michigan State College. 

Isaac L. Battin is now Assistant Professor of Mathematics at Drew Uni- 
versity. 

Robert E. Bechhofer has been promoted from Assistant Statistician to Associ- 
ate Statistician at Aberdeen Proving Ground. 

Blair M. Bennett is now Associate Mathematician with the National Bureau 
of Standards at Washington. 

Assistant Professor Thomas A. Bickerstaff on leave from the University of 
Mississippi, is in attendance at the University of Michigan. 

Dr. Ernest E. Blanche, formerly with Curtiss-Wright, is now Principal Statisti- 
cian, Chief of Operations with Statistics Division, Foreign Economic Administra- 
tion. 

Assistant Professor Frances L. Campbell, on leave from her position as Dean 
of Women at George Pepperdine College, is studying at the University of 
Michigan. 

Professor Haskell B. Curry, on leave from Pennsylvania State College, is now 
Mathematician at the Applied Physics Laboratory, Johns Hopkins Univer- 
sity. 

Dr. A. H. Copeland of the University of Michigan has been promoted to a 
professorship. 

Dr. Gerald J. Cox, formerly with the National Research Council, is now 
Research Chemist with Corn Products Refining Company, Argo, Illinois. 

Assistant Professor Paul Dressel of Michigan State College has been appointed 
Chairman of the Board of Examiners and Director of Counselling. 

William F. Elkin has accepted a position as Social Science Analyst, Vital 
Statistics Division, Bureau of the Census at Washington. 

Mark W. Eudey, on leave from the University of California, is serving as 
Operations Analyst with the Ninth Tactical Air Command. 
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Associate Professor Herbert P. Evans has been promoted to the rank of Pro. 
fessor of Mathematics at the University of Wisconsin. 

Raymond H. Fadner, formerly with the OCD, is now Statistician for the 
United Nations Relief and Rehabilitation Administration. 

Dr. Nicholas A. Fattu is now with the College Entrance Examination Board 
at Princeton. 

Evelyn Fix is now Lecturer in Mathematics and Research Assistant at the 
Statistics Laboratory of the University of California. 

Assistant Professor Harold A. Freeman has been promoted to the rank of As- 
sociate Professor of Statistics at Massachusetts Institute of Technology. 

Milton Friedman, formerly with the Treasury Department, is now with the 
Columbia University Statistical Research Group. 

E. L. Godfrey is now instructor in Mathematics and Physics at the Northwest 
Missouri State Teachers College at Maryville. 

Harry H. Goode, formerly Statistician with the New York Department of 
Health, is now Research Associate in Mathematics at Tufts College. 

Mrs. Dorothy K. Gottfried, formerly Assistant Statistician with the OPA, 
is now Research Analyst with the United States Army Signal Corps at Newark. 

Trygve Haavelmo, formerly with the Norwegian Shipping and Trade Mission, 
is now Commercial Secretary at the Norwegian Embassy at Washington. 

M. H. Henry, formerly Assistant Statistician with the Michigan Department 
of Social Welfare, is now Cost Analyst for the Fisher Body Corporation at 
Detroit. 

Dr. Robert W. B. Jackson has been promoted to the rank of Assistant Profes- 
sor and Assistant Director of the Department of Educational Research at the 
Ontario College of Education. 

Dr. Rachel M. Jenss, formerly Economic Analyst with the Treasury Depart- 
ment, is now with the State Department in the capacity of Assistant Chief, Cost 
of Living Division, Foreign Service Administration. 

Dr. Irving Kaplansky is Assistant Research Mathematician at Columbia 
University. 

Leo Katz, formerly Statistician with the Michigan Department of Labor and 
Industry, is now with the Aircraft Development Section of General Motors 
Corporation at Detroit. 

Lila F. Knudsen is now Statistician with the Food and Drug Administration. 

Dr. Richard L. Kozelka has been promoted to the rank of Professor of Eco- 
nomics and Statistics and Acting Dean, School of Business Administration at the 
University of Minnesota. 

Dr. George M. Kuznets has been promoted to the rank of Assistant Professor 
of Agricultural Economics at the University of California. 

Max LeLeiko, formerly at the University of North Carolina, is now an in- 
structor at Rutgers University. 

Assistant Professor M. 8. Macphail of Acadia University has been promoted 
to an associate professorship. 
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Dr. William G. Madow, formerly with the Bureau of Agricultural Economics, 
is now Statistician with the Census Bureau. 

Dr. Benjamin Malzberg is now Director of the Bureau of Statistics, New York 
State Department of Mental Hygiene. 

Dr. Henry B. Mann has taken a position as Research Associate at Ohio State 
University. 

Dr. John W. Mauchly has been promoted to the rank of Assistant Professor 
of Electrical Engineering at the University of Pennsylvania. 

Dr. R. v. Mises of Harvard University has been elected to membership in the 
American Academy of Arts and Sciences. 

Frederick C. Mosteller is now Research Associate at Princeton University. 

Franklin 8. Nelson has been promoted from Assistant Statistician to Associate 
Statistician at Picatinny Arsenal. 

Dr. Edward B. Olds, formerly Statistician with Curtiss-Wright, is now Eco- 
nomic Analyst with the United Nations Relief and Rehabilitation Administra- 
tion. 

Assistant Professor Charles K. Payne has been promoted to an associate pro- 
fessorship of mathematics at New York University. 

Paul Peach, formerly with United States Rubber Company, is now Quality 
Control Consultant with the War Production Board. 

Associate Professor Phillip J. Rulon has been promoted to the rank of Pro- 
fessor of Education and Acting Dean of the Harvard Graduate School of Educa- 
tion. 

Assistant Professor Paul A. Samuelson has been promoted to the rank of 
Associate Professor of Economics at Massachusetts Institute of Technology. 

Miss Marion M. Sandomire, formerly with the War Department, is now 
Statistician with the Navy Department, Bureau of Ships. 

William C. Shelton, formerly with the Department of Commerce, is now 
Senior Mathematical Statistician with the Navy Department. 

Blanche Skalak, formerly connected with Wilson Mechanical Instrument 
Company, is now Statistician with the Census Bureau. 

Associate Professor L. Edwin Smart has been promoted to the rank of Professor 
of Economics at Ohio State University. 

Dr. L. V. Toralballa, who has been a special instructor at the University of 
Michigan during the past year, is now at Princeton University for post-doctoral 
study. 

Assistant Professor Abraham Wald has been promoted to Associate Professor 
of Statistics at Columbia University. 

John F. Wyckoff has been promoted to Assistant Professor of Mathematics at 
Trinity College, Hartford. 

Ruth Zwerling, formerly with the New York City Employees Retirement 
system, is now Research Assistant with the Milbank Memorial Fund. 
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New Members 


The following persons have been elected to membership in the Institute: 


Budne, Thomas A. M.A. (State Teachers Coll. Montclair) Asst. Prod. Mgr., Atlas Air- 
craft Prod. Corp., 405 E. 45 St., New York, N. Y. 4765 Jersey St., Staten Island, N. Y, 

Darkow, Marguerite D. Ph.D. (Chicago) Asso. Prof. of Math., Hunter Coll., New York, 
N.Y. 16 E. 82 St., New York, 28, N. Y. 

Evers, Dillon Ph.D. (Iowa) Captain, Ord. Dept., Asst. Army Inspector of Ordnance, St, 
Louis Ord. Plant, 4300 Goodfellow Blvd., St. Louis, 20, Mo. 

Hintermaier, John C. Grad. (Phil. Textile Inst.) Supervisory Chem. Forstmann Woolen 
Co., 2 Barbour Ave., Passaic, N. J. 21 Marilyn Place, Clifton, N. J. 

Katz, Amron H. B.A. (Wisconsin) Asso. Physicist, Chief, Shutter Unit, Aerial Photo 
Lab., TSELA-4D, Wright Field, Dayton, Ohio. 

Lavin, MarvinM. M.S. (Chicago) Stat. Corning Glass Works. 80 E. Second St., Corning, 
New York. 

Levine, Harriet A.B. (Hunter) Asst. Math. Stat., Stat. Res. Group, Columbia Univer- 
sity, New York, N. Y. 309 W. 99th St., New York, 25, N. Y. 

Littauer, Sebastian B. D.Sc. (Mass. Inst. Tech.) Res. Eng., Bendix Radio Div., Bendix 
Aviation Corp., Towson, Md. Box 6797, Towson, 4, Md. 

Pastore, Nicholas M.S. (C.C. N.Y.) Teacher, St. John’s Prep. School, Brooklyn, N. Y. 
2944 Radcliff Ave., New York, N.Y. 

Rodrigues, Milton da Silva Dr. (Brazil) Prof. of Stat., Univ. of Sao Paulo, Brazil. Jnter- 
national House, Rm. 323, 500 Riverside Drive, New York, 27, N. Y. , 

Rothbard, Murray Senior (Columbia) 370 Central Park West, New York, N. Y. 

Rubin, Ernest M.A. (Columbia) Res. Analyst, U. S. Dept. of Justice, Immigration & 
Nat. Service, 7104 Emerson Ave., Upper Darby, Pa. 

Sousa, Alvaro Pedrode B.E. (Liverpool) Vice-Governor, Banco de Portugal, Monserrate, 
Rua Infante de Sagres, Estoril, Portugal. 

Wirick, Grover C., Jr. A.B. (Mich State) Sec. Stat. Res. Dept., Milwaukee Community 
Fund & Counc. of Soc. Agencies. 3205 W. Wisconsin Ave., #*C-8, Milwaukee, 8, Wis. 


Report on the Third Meeting of the Pittsburgh Chapter of the Institute 


The third meeting of the Pittsburgh Chapter of the Institute of Mathematical 
Statistics was held at Carnegie Union, Carnegie Institute of Technology, on 
Saturday, March 18, 1944. Twenty-five persons attended the meeting, includ- 
ing the following four members of the Institute: 

G. G. Eldredge, H. J. Hand, F. G. Norris and E. M. Schrock. 

At the morning session, Mr. E. M. Schrock, Aberdeen Proving Ground, Aber- 
deen, Md., presented a paper ‘‘SSome Fundamental Aspects of the Shewhart 
Quality Control Chart”. At the afternoon session, Mr. W. H. Thomas, National 
Tube Company Research Laboratory, presented an address ‘‘Analysis of Variance 
and Experimental Design in Metallurgical Research”. Mr. F. G. Norris, 
President of the Pittsburgh Chapter, acted as chairman. 

H. J. Hann, 
Secretary-Treasurer of the Pittsburgh Chapter. 


Report on the Fourth Meeting of the Pittsburgh Chapter of the Institute 


The fourth meeting of the Pittsburgh Chapter of the Institute of Mathematical 
Statistics was held at Carnegie Union, Carnegie Institute of Technology on 





NEWS AND NOTICES 443 


Monday evening, October 2, 1944. Twenty-four persons attended the meeting, 
including the following four members of the Institute: 

J. V. Sturtevant, J. Manuele, R. F. Passano, H. J. Hand. 

Lieut. J. H. Curtiss, USNR, Bureau of Ships, Washington, D. C., presented 
a paper on the ‘Rational Sampling in the Preparation of Specifications”’. 


R. F. Passano, Bethlehem Steel Company, Bethlehem, Pennsylvania, acted as 
chairman for the meeting. 


H. J. Hann, 
Secretary-Treasurer of the Pittsburgh Chapter. 





DIRECTORY OF THE INSTITUTE OF MATHEMATICAL STATISTICS! 
(As of November 1, 1944) 


(The names of Fellows of the Institute are designated by *) 


Acerboni, Prof. Argentino V. Dr. Ec. (Buenos Aires) Facultad de C. Economicas, Buenos 
Aires, Argentina. Larroque 232, Banfield, Argentina. 

Allen, Roy G. D.Sc. (London) Combined Production and Resources Board, Soc. Se- 

curity Bldg., Washington, D.C. 1306 Shepherd St., N.W., Washington, 11, D.C. 

Alt, Franz L. Ph.D. (Vienna) U.S. Army. 271 Fort Washington Ave., New York, 32, 
Mud 

Alter, Dinsmore Col., CAC, c/o A.T.S., Fort Mason, San Francisco, Calif. 

Anderson, Paul H. Ph.D. (Illinois) Stat. Analyst, Distribution Mgt. Unit, Bur. of 
Foreign & Domestic Com., Washington, 25, D. C. 

Anderson, Richard L. Ph.D. (Iowa State Coll.) Res. Math., Princeton Univ., Fine Hall, 
Princeton, N. J. 

Anderson, Theodore W., Jr. M.A. (Princeton) Res. Math., Princeton Univ., Fine Hall, 
Princeton, N. J. . 

Andrews, T. Gaylord. Ph.D. (Nebraska) Instr. Barnard College, Columbia Univ., New 
York, N. Y. 

Angell, Dorothy T. Stat. Analyst, Bell Tel. Labs., Murray Hill, N. J. 

Arias, B., Jorge C.E. (Guatemala) 3 Avenida Sur 65, Guatemala City, Guatemala, 
Central America. 

Arnold, Asso. Prof. Herbert E. Ph.D. (Yale) Wesleyan Univ., Middletown, Conn. 

Arnold, Kenneth J. Ph.D. (Mass. Inst. Tech.) Sen. Math. Stat., Stat. Res. Group, 
Columbia Univ., 401 W. 118th St., New York, 27, N. Y. 

Aroian, Leo A. Ph.D. (Michigan) Instr. Hunter Coll., New Yerk, N. Y. 247 Wadsworth 
Ave., New York, 33, N. Y. 

Arrow, Kenneth J. M.A. (Columbia) 1st Lieut., A.C. Weather Div., Ac/As, O.C. & R., 
Hdatrs. AAF, Pentagon Bldg., Washington, D. C. 1416 Nicholson St., N.W., Wash- 
ington, 11, D.C. 

Bachelor, Robert W. M.B.A. (Washington) American Bankers Assn., 22 East 40th St., 
New York, 16, N. Y. 

Bacon, Asso. Prof. Harold M. Ph.D. (Stanford) Stanford Univ., Stanford, Calif. Boz 
1114. 

Bailey, Arthur L. B.S. (Michigan) Stat., American Mutual Alliance, 60 E. 42nd St., 
New York, N. Y. P.O. Box 278, Ramsey, N. J. 

*Baker, Asst. Prof. George A. Ph.D. (Illinois) Asst. Prof. of Math. and Asst. Stat. in 
the Exp. Sta., College of Agriculture, Univ. of California, Davis, Calif. 

Baldwin, Woodson W. S.B. (Mass. Inst. Tech.) Capt., Ord. Dept., U. S. A., Office of 
Field Dir. of Ammunition Plants, 3629 Lindell Blvd., St. Louis, 8, Mo. 3744 Lindell 
Blvd. 

Bancroft, Asst. Prof. Theodore A. Ph.D. (Iowa State Coll.) Iowa State Coll., Math. 
Dept., Ames, Iowa. . 

Barnes, Jarvis M.A. (George Peabody Coll. for Teachers) Atlanta Board of Educ. 14th 
Floor, City Hall, Atlanta, Ga. 


1 Members were asked to supply fresh information for this Directory. Records may be 
inexact or incomplete because of failure of some members to comply with this request. 
Changes in addresses, or errors in names, titles or addresses, should be reported to the Secre- 
tary. 
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Barnes, JohnL. Ph.D. (Princeton) Tech. Staff, Bell Tel. Labs., 463 West St., New York, 
N. Y. 19 E. River Rd., Rumson, N. J. 

Barr, Prof. ArvilS. Ph.D. (Wisconsin) Univ. of Wisconsin, Madison, Wis. 

Barral-Souto, Prof. José Sc.D. (Buenos Aires) Univ. of Buenos Aires, Buenos Aires, 
Argentina. Cordoba 1459. 

*Bartky, Asso. Dean Walter Ph.D. (Chicago) Univ. of Chicago, Chicago, IIl. 

Bassford, Horace R. B.A. (Trinity Coll.) Vice Pres. and Actuary, Metropolitan Life 
Ins. Co., 1 Madison Ave., New York, 10, N. Y. 

*Baten, Prof. William D. Ph.D. (Michigan) Prof. of Math. Mich. State Coll. and Res. 
Prof. Mich. Agri. Exp. Sta., Mich. State Coll., E. Lansing, Mich. 411 Marshall St., 
E. Lansing, Mich. p 

Bates, Prof. O. Kenneth Sc.D. (Mass. Inst. Tech.) Prof. of Math. and Head of Dept., 
The St. Lawrence Univ., Canton, N. Y. 

Battin, Asst. Prof. Isaac L. A.M. (Swarthmore) Drew Univ., Madison, N. J. 14 Glen- 
wild Rd., Madison, N. J. 

Beall, Geoffrey Ph.D. (London) Asst. Entomologist, Dominion Entomological Lab., 
Chatham, Ont., Can. : 

Bechhofer, Robert E. A.B. (Columbia) Asso. Stat. (Ordnance) Ordnance Res. & 
Development Center, Arms & Ammunition Div., Aberdeen Proving Gd., Md. Boz 
21, Aberdeen, Md. 

Becker, Harold W. Elec. Inst., Mare Is. Training School, Bldg. 146, Mare Island, Calif. 
1426 Amador, Vallejo, Calif. 

Beebe, Lieut. Gilbert W. Ph.D. (Columbia) Sn. C., A. U. 8. Control Div., Office of the 
Surgeon General, 1818 H St., N.W. Washington, D. C. 

Been, Richard O. M.A. (George Washington) Sr. Agric. Econ., U. S. Bur. of Agric. 
Economics, 3433 South Bldg., Washington, D. C. 

Bellison, Harold R. S.B.,S.M. (Mass. Inst. Tech.) Industria] Eng., War Dept., Ord. 
Dept., Pentagon Bldg., Arlington, Va. 3416 B St., S.E., Washington, 19, D.C. 
Belz, Asso. Prof. Maurice H. M.A. (Melbourne) Univ. of Melbourne, Carlton, N. 3, 

Victoria, Australia. . 

Bennett, Prof. Albert A. Ph.D. (Princeton) Brown Univ., Providence, R. I.; Maj., 
Ord. Dept., Ballistics Res. Lab., Aberdeen Proving Gd., Md. 

Bennett, Blair, M. M.A. (Columbia) Asso. Math. Nat. Bur. of Standards, Washington, 
D.C. 1410 M St., N.W., Washington, D.C. 

Bennett, Carl A. A.M. (Michigan) Stat. General Del. Oak Ridge, Tenn. 

Berger, Richard M.A. (Columbia) Ensign, U. S. Navy, 24 Rugby Rd., Rockville Centre, 
x. Y¥. 

Berkson, Joseph M.D., D.Sc. (Johns Hopkins) Col. M.C., U.S. Army, Army Air Forces, 
Washington, D. C. 

Bickerstaff, Asst. Prof. Thomas A. M.A. (Mississippi) 709 Haven St., Ann Arbor, Mich. 

Birnbaum, Asst. Prof. Z. William Ph.D. (Lwow) Univ. of Washington, Seattle, Wash. 

Blackadar, Walter. L. B.A. (McMaster) Asso. Actuary, Equitable Life Assurance 
Society of the U.S. 393 7th Ave., New York,1, N.Y. 

Blackburn, Asso. Prof. Raymond F. Ph.D. (Pittsburgh) Head, Dept. of Stat., Univ. 
of Pittsburgh, Pittsburgh, 13, Pa. 

Blake, Archie Ph.D. (Chicago) Eng. Natl. Inventors Council, Washington, D.C. 2812 
Cathedral Ave., N.W., Washington, D.C. 

Blanche, Ernest E. Ph.D. (Illinois) Prin. Stat., Chief of Operations, Foreign Econ. 
Admin., Stat. Div., 515-22nd St., N. W., Washington, D. C. 3228 Martha Custis 
Drive, Parkfairfax, Alexandria, Va. 

*Bliss, Chester I. Ph.D. (Columbia) Biometrician, Conn. Agri. Exp. Sta.; Lecturer in 
Biometry (Asso. Prof.), Yale Univ., New Haven, Conn. Boz 1106. 


Bloom, Rose B.A. (Hunter Coll.) Private, 1263 SCSU, Mason GH, Brentwood, Long 
Island, N. Y. 
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Bonis, Austin J. B.S. (C.C.N.Y.) Major, G-I War Dept. Gen. Staff, Washington, D. ¢, 
2500 Que St., N.W., Washington, D. C. 

Bonnar, Robert U. M.S. (Washington) Asso. Chemist, Bur. of Ships, Navy Dept., Wash- 
ington, D.C. 414 Whitestone Rd., Silver Spring, Md. 
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